Skip to content

ImportError: cannot import name 'notf' from 'tensorboard.compat'  #10

@Alishah-Naushad

Description

@Alishah-Naushad

I am using Ubuntu 22.04 and have setup the conda python environment as mentioned in the Readme.md file. But I am facing issues while running the code.

Initial Issue:
Traceback (most recent call last):
 File "train.py", line 9, in
  from torch.utils.tensorboard import SummaryWriter
 File "/home/alishah/miniconda3/envs/carl/lib/python3.7/site-packages/torch/utils/tensorboard/init.py", line 8, in
  from .writer import FileWriter, SummaryWriter # noqa F401
 File "/home/alishah/miniconda3/envs/carl/lib/python3.7/site-packages/torch/utils/tensorboard/writer.py", line 14, in
  from tensorboard.compat.proto.event_pb2 import SessionLog
 File "/home/alishah/miniconda3/envs/carl/lib/python3.7/site-packages/tensorboard/compat/proto/event_pb2.py", line 17, in
  from tensorboard.compat.proto import summary_pb2 as tensorboard_dot_compat_dot_proto_dot_summary__pb2
 File "/home/alishah/miniconda3/envs/carl/lib/python3.7/site-packages/tensorboard/compat/proto/summary_pb2.py", line 17, in
  from tensorboard.compat.proto import tensor_pb2 as tensorboard_dot_compat_dot_proto_dot_tensor__pb2
 File "/home/alishah/miniconda3/envs/carl/lib/python3.7/site-packages/tensorboard/compat/proto/tensor_pb2.py", line 16, in
  from tensorboard.compat.proto import resource_handle_pb2 as tensorboard_dot_compat_dot_proto_dot_resource__handle__pb2
 File "/home/alishah/miniconda3/envs/carl/lib/python3.7/site-packages/tensorboard/compat/proto/resource_handle_pb2.py", line 16, in
  from tensorboard.compat.proto import tensor_shape_pb2 as tensorboard_dot_compat_dot_proto_dot_tensor__shape__pb2
 File "/home/alishah/miniconda3/envs/carl/lib/python3.7/site-packages/tensorboard/compat/proto/tensor_shape_pb2.py", line 42, in
  serialized_options=None, file=DESCRIPTOR),
 File "/home/alishah/miniconda3/envs/carl/lib/python3.7/site-packages/google/protobuf/descriptor.py", line 561, in new
  _message.Message._CheckCalledFromGeneratedFile()
TypeError: Descriptors cannot not be created directly.
If this call came from a _pb2.py file, your generated code is out of date and must be regenerated with protoc >= 3.19.0.
If you cannot immediately regenerate your protos, some other possible workarounds are:
 1. Downgrade the protobuf package to 3.20.x or lower.
 2. Set PROTOCOL_BUFFERS_PYTHON_IMPLEMENTATION=python (but this will use pure-Python parsing and will be much slower).

More information: https://developers.google.com/protocol-buffers/docs/news/2022-05-06#python-updates
Traceback (most recent call last):
 File "/home/alishah/miniconda3/envs/carl/lib/python3.7/runpy.py", line 193, in _run_module_as_main
  "main", mod_spec)
 File "/home/alishah/miniconda3/envs/carl/lib/python3.7/runpy.py", line 85, in _run_code
  exec(code, run_globals)
 File "/home/alishah/miniconda3/envs/carl/lib/python3.7/site-packages/torch/distributed/launch.py", line 261, in
  main()
 File "/home/alishah/miniconda3/envs/carl/lib/python3.7/site-packages/torch/distributed/launch.py", line 257, in main
  cmd=cmd)
subprocess.CalledProcessError: Command '['/home/alishah/miniconda3/envs/carl/bin/python', '-u', 'train.py', '--local_rank=0', '--workdir', '/home/alishah/datasets', '--cfg_file', './configs/scl_transformer_config.yml', '--logdir', '/home/alishah/tmp/scl_transformer_logs']' returned non-zero exit status 1.
expand_more

What Did I do:
I downgrade the protobuf to 3.20.x as mentioned in the error, after that I again run the code and it caused another issue.

Second Issue:
carl) ali@Ali:~/CARL_code$ python -m torch.distributed.launch --nproc_per_node 1 train.py --workdir ~/datasets --cfg_file ./configs/scl_transformer_config.yml --logdir ~/tmp/scl_transformer_logs
2024-06-26 16:39:38.863368: I tensorflow/core/platform/cpu_feature_guard.cc:193] This TensorFlow binary is optimized with oneAPI Deep Neural Network Library (oneDNN) to use the following CPU instructions in performance-critical operations: AVX2 AVX_VNNI FMA
To enable them in other operations, rebuild TensorFlow with the appropriate compiler flags.
2024-06-26 16:39:39.556099: I tensorflow/core/util/port.cc:104] oneDNN custom operations are on. You may see slightly different numerical results due to floating-point round-off errors from different computation orders. To turn them off, set the environment variable TF_ENABLE_ONEDNN_OPTS=0.
2024-06-26 16:39:39.699120: W tensorflow/compiler/xla/stream_executor/platform/default/dso_loader.cc:64] Could not load dynamic library 'libcudart.so.11.0'; dlerror: libcudart.so.11.0: cannot open shared object file: No such file or directory
2024-06-26 16:39:39.699209: I tensorflow/compiler/xla/stream_executor/cuda/cudart_stub.cc:29] Ignore above cudart dlerror if you do not have a GPU set up on your machine.
2024-06-26 16:39:41.603680: W tensorflow/compiler/xla/stream_executor/platform/default/dso_loader.cc:64] Could not load dynamic library 'libnvinfer.so.7'; dlerror: libnvinfer.so.7: cannot open shared object file: No such file or directory
2024-06-26 16:39:41.603818: W tensorflow/compiler/xla/stream_executor/platform/default/dso_loader.cc:64] Could not load dynamic library 'libnvinfer_plugin.so.7'; dlerror: libnvinfer_plugin.so.7: cannot open shared object file: No such file or directory
2024-06-26 16:39:41.603849: W tensorflow/compiler/tf2tensorrt/utils/py_utils.cc:38] TF-TRT Warning: Cannot dlopen some TensorRT libraries. If you would like to use Nvidia GPU with TensorRT, please make sure the missing libraries mentioned above are installed properly.
Traceback (most recent call last):
File "/home/ali/miniconda3/envs/carl/lib/python3.7/site-packages/tensorboard/compat/init.py", line 42, in tf
from tensorboard.compat import notf # noqa: F401
ImportError: cannot import name 'notf' from 'tensorboard.compat' (/home/ali/miniconda3/envs/carl/lib/python3.7/site-packages/tensorboard/compat/init.py)

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
File "train.py", line 207, in
main()
File "train.py", line 156, in main
summary_writer = SummaryWriter(os.path.join(cfg.LOGDIR, 'train_logs'))
File "/home/ali/miniconda3/envs/carl/lib/python3.7/site-packages/torch/utils/tensorboard/writer.py", line 225, in init
self._get_file_writer()
File "/home/ali/miniconda3/envs/carl/lib/python3.7/site-packages/torch/utils/tensorboard/writer.py", line 256, in _get_file_writer
self.flush_secs, self.filename_suffix)
File "/home/ali/miniconda3/envs/carl/lib/python3.7/site-packages/torch/utils/tensorboard/writer.py", line 66, in init
log_dir, max_queue, flush_secs, filename_suffix)
File "/home/ali/miniconda3/envs/carl/lib/python3.7/site-packages/tensorboard/summary/writer/event_file_writer.py", line 72, in init
tf.io.gfile.makedirs(logdir)
File "/home/ali/miniconda3/envs/carl/lib/python3.7/site-packages/tensorboard/lazy.py", line 65, in getattr
return getattr(load_once(self), attr_name)
File "/home/ali/miniconda3/envs/carl/lib/python3.7/site-packages/tensorboard/lazy.py", line 97, in wrapper
cache[arg] = f(arg)
File "/home/ali/miniconda3/envs/carl/lib/python3.7/site-packages/tensorboard/lazy.py", line 50, in load_once
module = load_fn()
File "/home/ali/miniconda3/envs/carl/lib/python3.7/site-packages/tensorboard/compat/init.py", line 45, in tf
import tensorflow
File "/home/ali/miniconda3/envs/carl/lib/python3.7/site-packages/tensorflow/init.py", line 37, in
from tensorflow.python.tools import module_util as _module_util
File "/home/ali/miniconda3/envs/carl/lib/python3.7/site-packages/tensorflow/python/init.py", line 45, in
from tensorflow.python.feature_column import feature_column_lib as feature_column
File "/home/ali/miniconda3/envs/carl/lib/python3.7/site-packages/tensorflow/python/feature_column/feature_column_lib.py", line 18, in
from tensorflow.python.feature_column.feature_column import *
File "/home/ali/miniconda3/envs/carl/lib/python3.7/site-packages/tensorflow/python/feature_column/feature_column.py", line 143, in
from tensorflow.python.layers import base
File "/home/ali/miniconda3/envs/carl/lib/python3.7/site-packages/tensorflow/python/layers/base.py", line 16, in
from tensorflow.python.keras.legacy_tf_layers import base
File "/home/ali/miniconda3/envs/carl/lib/python3.7/site-packages/tensorflow/python/keras/init.py", line 25, in
from tensorflow.python.keras import models
File "/home/ali/miniconda3/envs/carl/lib/python3.7/site-packages/tensorflow/python/keras/models.py", line 22, in
from tensorflow.python.keras.engine import functional
File "/home/ali/miniconda3/envs/carl/lib/python3.7/site-packages/tensorflow/python/keras/engine/functional.py", line 32, in
from tensorflow.python.keras.engine import training as training_lib
File "/home/ali/miniconda3/envs/carl/lib/python3.7/site-packages/tensorflow/python/keras/engine/training.py", line 44, in
from tensorflow.python.keras import callbacks as callbacks_module
File "/home/ali/miniconda3/envs/carl/lib/python3.7/site-packages/tensorflow/python/keras/callbacks.py", line 68, in
import requests
File "/home/ali/miniconda3/envs/carl/lib/python3.7/site-packages/requests/init.py", line 164, in
from .api import delete, get, head, options, patch, post, put, request
File "/home/ali/miniconda3/envs/carl/lib/python3.7/site-packages/requests/api.py", line 11, in
from . import sessions
File "/home/ali/miniconda3/envs/carl/lib/python3.7/site-packages/requests/sessions.py", line 15, in
from .adapters import HTTPAdapter
File "/home/ali/miniconda3/envs/carl/lib/python3.7/site-packages/requests/adapters.py", line 60, in
from urllib3.contrib.socks import SOCKSProxyManager
File "/home/ali/miniconda3/envs/carl/lib/python3.7/site-packages/urllib3/contrib/socks.py", line 75, in
class _TYPE_SOCKS_OPTIONS(typing.TypedDict):
AttributeError: module 'typing' has no attribute 'TypedDict'
Traceback (most recent call last):
File "/home/ali/miniconda3/envs/carl/lib/python3.7/runpy.py", line 193, in _run_module_as_main
"main", mod_spec)
File "/home/ali/miniconda3/envs/carl/lib/python3.7/runpy.py", line 85, in _run_code
exec(code, run_globals)
File "/home/ali/miniconda3/envs/carl/lib/python3.7/site-packages/torch/distributed/launch.py", line 261, in
main()
File "/home/ali/miniconda3/envs/carl/lib/python3.7/site-packages/torch/distributed/launch.py", line 257, in main
cmd=cmd)
subprocess.CalledProcessError: Command '['/home/ali/miniconda3/envs/carl/bin/python', '-u', 'train.py', '--local_rank=0', '--workdir', '/home/ali/datasets', '--cfg_file', './configs/scl_transformer_config.yml', '--logdir', '/home/ali/tmp/scl_transformer_logs']' returned non-zero exit status 1.

Don't know why this is causing all this issues, cannot find solution on the internet

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions