Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Core] manage nccl via a pypi package & upgrade to pt 2.2.1 #3805

Merged
merged 16 commits into from
Apr 4, 2024

Conversation

youkaichao
Copy link
Member

@youkaichao youkaichao commented Apr 2, 2024

See #3802 for why we don't use another pypi packge to manage nccl .

See NVIDIA/nccl#1234 on why we have to manage nccl when upgrading to pt 2.2.1 .

Usage:

python -m vllm.tools.install_nccl --cuda 12 --nccl 2.18.3

@youkaichao youkaichao changed the title [Core] manage nccl via a tool script [Core] manage nccl via a tool script & upgrade to pt 2.2.1 Apr 3, 2024
@youkaichao youkaichao changed the title [Core] manage nccl via a tool script & upgrade to pt 2.2.1 [Core] manage nccl via a pypi package & upgrade to pt 2.2.1 Apr 3, 2024
Copy link
Collaborator

@WoosukKwon WoosukKwon left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@youkaichao Awesome! Thanks for the great work! ! I didn't realize that updating PyTorch could be this challenging. Left minor comments. Please check them out.

.github/workflows/publish.yml Show resolved Hide resolved
@@ -17,3 +17,4 @@ pynvml == 11.5.0
triton >= 2.1.0
outlines == 0.0.34
tiktoken == 0.6.0 # Required for DBRX tokenizer
vllm-nccl-cu12>=2.18<2.19 # for downloading nccl library
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can we do this?

Suggested change
vllm-nccl-cu12>=2.18<2.19 # for downloading nccl library
vllm-nccl-cu12>=2.18.3<2.19 # for downloading nccl library

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Fun fact: actually I have https://pypi.org/project/vllm-nccl-cu12/2.18.1.0.1.0/ , which is nccl 2.18.1 . I don't find nccl 2.18.3 at https://developer.download.nvidia.com/compute/redist/nccl/ .

setup.py Show resolved Hide resolved
vllm/model_executor/parallel_utils/pynccl.py Outdated Show resolved Hide resolved
Copy link
Collaborator

@WoosukKwon WoosukKwon left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM! Thanks again for the great work!

@youkaichao youkaichao merged commit ca81ff5 into vllm-project:main Apr 4, 2024
35 checks passed
@youkaichao youkaichao deleted the tools.nccl branch April 4, 2024 17:26
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants