-
-
Notifications
You must be signed in to change notification settings - Fork 4.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bug]: Invalid Device Ordinal on ROCm #4131
Comments
I get same error
|
did you solve this? |
Hello Eric, I just encountered the same issue as you. Have you resolved it? |
Did you solve this? Several of my machine encountered this issue at the same time. |
Reinstalling the conda env can solve this. |
how to do this? |
I will try this if I encounter it again |
Do you still have the issue? Please update. |
Closing this issue as the issue should have been resolved. Please open a new one if you run into the similar issue again |
Your current environment
🐛 Describe the bug
Issue with invalid ordinal when running with tp=2 on ROCm:
python benchmarks/benchmark_throughput.py --input-len=50 --output-len=100 --model=mistralai/Mistral-7B-v0.1 --tensor-parallel-size=2 --enforce-eager
This with building on latest main. Was hoping this was fixed with #3770, but no amount of environmental configuration has helped either (CUDA_VISIBLE_DEVICES, etc).
The text was updated successfully, but these errors were encountered: