Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

vLLM: RayWorkerVllm.execute_method failed with error 'TCPStore is not available' #878

Closed
nkwangleiGIT opened this issue Mar 18, 2024 · 2 comments
Assignees

Comments

@nkwangleiGIT
Copy link
Contributor

nkwangleiGIT commented Mar 18, 2024

Ray v2.9.3
vLLM 0.3.3

[RayWorkerVllm.execute_method]
failed with callstack below:

Error Type: TASK_EXECUTION_EXCEPTION

home/ray/anaconda3/lib/python3.9/site-packages/cupyx/distributed/_nccl_comm.py", line 97, in _init_with_tcp_store
self._store_proxy.barrier()
File "/home/ray/anaconda3/lib/python3.9/site-packages/cupyx/distributed/_store.py", line 152, in barrier
self._send_recv(_store_actions.Barrier())
File "/home/ray/anaconda3/lib/python3.9/site-packages/cupyx/distributed/_store.py", line 142, in _send_recv
raise RuntimeError('TCPStore is not available')
RuntimeError: TCPStore is not available

Related issues on Ray and vLLM:
ray-project/ray#43756
vllm-project/vllm#3334

image

@nkwangleiGIT nkwangleiGIT changed the title RayWorkerVllm.execute_method failed with error 'TCPStore is not available' vLLM: RayWorkerVllm.execute_method failed with error 'TCPStore is not available' Mar 18, 2024
@nkwangleiGIT
Copy link
Contributor Author

add --enforce-eager to EXTRA_ARGS to disable cupy, and vllm mode can work normally.

@bjwswang
Copy link
Collaborator

should be fixed by #884

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants