-
Notifications
You must be signed in to change notification settings - Fork 664
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
MVDR module can't work in distributed training #1991
Comments
This makes me wonder ... Should we keep using pseudo complex type? (#1337) Or can |
I tested |
The PyTorch that works is built from source. Let me try with a stable version to see how it is like. |
I think this issue is solved in latest PyTorch. Using After discussion with @fmassa, I'm thinking adding |
Close it since MVDR works in latest PyTorch in distributed training. |
🐛 Describe the bug
NCCL currently only supports floating point and integer dtype. (See pytorch/pytorch#45760)
The MVDR module forces to use
cdouble
dtype to improve the numerical robustness, however, it will cause an runtime error when integrating it to a neural network:To solve it, the temporary workaround is relax the dtype constraint by allowing MVDR to use
cfloat
dtype.Versions
The issue is general to the current PyTorch and torchaudio version.
The text was updated successfully, but these errors were encountered: