You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This gives a runtime error when constructing the DDP wrapper:
Traceback (most recent call last):
File "./train.py", line 256, in <module>
main(pargs)
File "./train.py", line 167, in main
ddp_net = DDP(net, device_ids=[device.index],
File "/opt/conda/lib/python3.8/site-packages/torch/nn/parallel/distributed.py", line 551, in __init__
self.process_group = _get_default_group()
File "/opt/conda/lib/python3.8/site-packages/torch/distributed/distributed_c10d.py", line 412, in _get_default_group
raise RuntimeError(
RuntimeError: Default process group has not been initialized, please make sure to call init_process_group.
The text was updated successfully, but these errors were encountered:
It's probably not a common use-case, but the "dummy" wireup method for deepcam doesn't seem to work.
Here's an example script at NERSC:
This gives a runtime error when constructing the DDP wrapper:
The text was updated successfully, but these errors were encountered: