Add compute capability 6.x support #2635

jasonacox · 2024-01-28T02:41:20Z

UPDATED 23-Apr-2024

This proposal adds a script pascal.sh which allows users to add Pascal GPU (e.g. GTX 1060, Tesla P100) to vLLM

The script:

Adds 6.0, 6.1 and 6.2 (compute capability) GPU architectures to the CMakeLists.txt and Dockerfile files

Run:

# Add Pascal Support
./pascal.sh

# You can now build from source with Pascal GPU support:
pip install -e .

# or build the Docker image with:
DOCKER_BUILDKIT=1 docker build . --target vllm-openai --tag vllm/vllm-openai

Notes:

Pascal architectures are still supported by latest CUDA.
I understand that adding 6.x support expands the wheel size beyond the limit so this provides a way for users of this older architecture to use vLLM.

Build and run example:

# Build From Source
git clone https://github.com/vllm-project/vllm.git
cd vllm
./pascal.sh
pip install -e .

# Run OpenAI API Compatible Server
python3 -m vllm.entrypoints.openai.api_server \
    --tensor-parallel-size 4 \
    --worker-use-ray \
    --host 0.0.0.0 \
    --port 8080  \
    --model mistralai/Mistral-7B-Instruct-v0.1 \
    --served-model-name mistralai/Mistral-7B-Instruct-v0.1 \
    --dtype float \
    --max-model-len 20000

Related: #963 #1284

Thank you for the great project!!! 🙏

This adds the 6.x architectures to the supported list but also presents a warning that capabilities < 7.0 are untested and may have issues.

Failed build based on yapf - updating to suggested format: NVIDIA_SUPPORTED_ARCHS = { "6.0", "6.1", "6.2", "7.0", "7.5", "8.0", "8.6", "8.9", "9.0" }

cduk

Why do you delete some of the ROCM cards from the supported list?

jasonacox · 2024-03-17T22:43:02Z

Why do you delete some of the ROCM cards from the supported list?

Hi @cduk - This PR doesn't touch the ROCM cards. It only adds the Nvidia Pascal (capability 6) cards. Here are the diffs:

...

cduk · 2024-03-18T07:48:25Z

Maybe I'm mistaken. I was looking at this diff:

Fuckingnameless · 2024-03-26T05:41:53Z

what's the status on this?

nkuhn-vmw · 2024-03-26T20:08:14Z

Hello - very interested in this PR - as I also am running multiple P40s and would like to use vLLM

youkaichao · 2024-04-24T04:07:59Z

Thanks for the effort. You can keep this branch, but this PR is not necessary. Closed.

jasonacox · 2024-04-24T04:28:05Z

No problem. Thanks @youkaichao

Permanent solution would be: #4290

jasonacox added 6 commits January 27, 2024 18:22

Add compute capability == 6 support

3691e98

This adds the 6.x architectures to the supported list but also presents a warning that capabilities < 7.0 are untested and may have issues.

Comply with yapf format changes

cd31b01

Failed build based on yapf - updating to suggested format: NVIDIA_SUPPORTED_ARCHS = { "6.0", "6.1", "6.2", "7.0", "7.5", "8.0", "8.6", "8.9", "9.0" }

Merge branch 'vllm-project:main' into patch-1

166a762

Merge branch 'main' into patch-1

3a39549

Update setup.py for yapf

941f23d

Merge branch 'main' into patch-1

96c083d

jasonacox mentioned this pull request Feb 24, 2024

Support for compute capability <7.0 #963

Closed

cduk reviewed Mar 17, 2024

View reviewed changes

youkaichao mentioned this pull request Mar 26, 2024

[Bug]: No output on WSL (Debian, Windows 11) #3646

Closed

cduk mentioned this pull request Apr 10, 2024

[Feature]: Merge support for compute capability 6.x #3969

Closed

jasonacox added 5 commits April 23, 2024 20:31

Script to add Pascal GPU support to vLLM

7a6fadc

Merge branch 'main' into patch-1

555a96f

Rebase

e46b94d

Update Pascal GPU support in vLLM build files

fa939e5

Merge branch 'patch-1' of https://github.com/jasonacox/vllm into patch-1

9e51c84

youkaichao closed this Apr 24, 2024

jasonacox mentioned this pull request Apr 27, 2024

[Hardware][Nvidia] Enable support for Pascal GPUs #4409

Open

sasha0552 mentioned this pull request May 10, 2024

[NVIDIA] Add support for tensor conversion from fp16 to fp32 using ExtFOp triton-lang/triton#3874

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add compute capability 6.x support #2635

Add compute capability 6.x support #2635

jasonacox commented Jan 28, 2024 •

edited

Loading

cduk left a comment

jasonacox commented Mar 17, 2024

cduk commented Mar 18, 2024

Fuckingnameless commented Mar 26, 2024

nkuhn-vmw commented Mar 26, 2024

youkaichao commented Apr 24, 2024

jasonacox commented Apr 24, 2024

Add compute capability 6.x support #2635

Add compute capability 6.x support #2635

Conversation

jasonacox commented Jan 28, 2024 • edited Loading

UPDATED 23-Apr-2024

cduk left a comment

Choose a reason for hiding this comment

jasonacox commented Mar 17, 2024

cduk commented Mar 18, 2024

Fuckingnameless commented Mar 26, 2024

nkuhn-vmw commented Mar 26, 2024

youkaichao commented Apr 24, 2024

jasonacox commented Apr 24, 2024

jasonacox commented Jan 28, 2024 •

edited

Loading