-
Notifications
You must be signed in to change notification settings - Fork 356
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bug] config to import yapf causes 'EOFError: Ran out of input' when distributed training #1480
Comments
I think it's a yapf problem, the repo of yapf also reported this link |
I am getting this as well |
In my case, setup is: "mmengine==0.10.2" on ubuntu |
Same problem using the older mmcv-full (1.3.0) and mmseg=0.11.0. Small update: pip install yapf=0.32 fixes this issue for me. |
I used a import time
import os
def wait_before_import_config():
t = int(os.environ.get('LOCAL_RANK', 0))
time.sleep(t * 0.5)
def wait_after_import_config():
t = int(os.environ.get('WORLD_SIZE', 0)) - int(os.environ.get('LOCAL_RANK', 0))
time.sleep(t * 0.5)
wait_before_import_config()
from mmengine.config import Config
wait_after_import_config() |
This has been happening a lot for us. On older versions of mmcv. |
@DeclK you are a hero! |
pls check this issue,may be helpful for you:google/yapf#1204 |
Similar problem i meet when using mmengine,how i find a way to fix in this issue:google/yapf#1204 |
Prerequisite
Environment
None
Reproduces the problem - code sample
None
Reproduces the problem - command or script
Just run
torchrun --nproc_per_node 8 mmengine_train.py config.py --launcher pytorch
Reproduces the problem - error message
Additional information
No response
The text was updated successfully, but these errors were encountered: