Docker image #1293

sureshbhusare · 2023-10-08T04:27:57Z

Any dockerfile ? or any official docker image ?

MaxZabarka · 2023-10-08T08:25:56Z

+1

casper-hansen · 2023-10-08T09:40:19Z

Mistral created a Dockerfile here:
https://github.com/mistralai/mistral-src/blob/main/deploy/Dockerfile

agrogov · 2023-10-08T12:45:49Z

CUDA-based image is too fat and useless, just use slim python image.
I'm using this Dockerfile to run Mistral on 2 GPUs:

FROM python:3.11-slim
ENV DEBIAN_FRONTEND=noninteractive

RUN pip install --upgrade pip && \
    pip install --upgrade ray && \
    pip install --upgrade pyarrow && \
    pip install pandas fschat==0.2.23 && \
    pip install --upgrade vllm
RUN apt-get update && apt-get install git -y
RUN pip install git+https://github.com/huggingface/transformers.git

EXPOSE 8080 6379

CMD echo "Y" | ray start --head && sleep 5 && ray status && python -m vllm.entrypoints.openai.api_server \
        --served-model $MODEL_ID \
        --model $MODEL_ID \
        --tensor-parallel-size 2 \
        --worker-use-ray \
        --host 0.0.0.0 \
        --port 8080 \
        --gpu-memory-utilization 0.45 \
        --max-num-batched-tokens 32768

docker run -d --gpus all -it --ipc=host --shm-size 10g -e MODEL_ID=$model -p 8080:8080 -p 6379:6379 -v $volume:/root/.cache/huggingface/hub/ morgulio/vllm:0.2.0
Before start specify model and volume vars as you need.

sureshbhusare · 2023-10-08T13:36:07Z

What are your GPUs?

…

On Sun, Oct 8, 2023 at 8:46 AM Alexey Rogov ***@***.***> wrote: CUDA-based image is too fat and useless, just use slim python image. I'm using this Dockerfile to run Mistral on 2 GPUs: `FROM python:3.11-slim ENV DEBIAN_FRONTEND=noninteractive RUN pip install --upgrade pip && pip install --upgrade ray && pip install --upgrade pyarrow && pip install pandas fschat==0.2.23 && pip install --upgrade vllm RUN apt-get update && apt-get install git -y RUN pip install git+https://github.com/huggingface/transformers.git EXPOSE 8080 6379 CMD echo "Y" | ray start --head && sleep 5 && ray status && python -m vllm.entrypoints.openai.api_server --served-model $MODEL_ID --model $MODEL_ID --tensor-parallel-size 2 --worker-use-ray --host 0.0.0.0 --port 8080 --gpu-memory-utilization 0.45 --max-num-batched-tokens 32768` docker run -d --gpus all -it --ipc=host --shm-size 10g -e MODEL_ID=$model -p 8080:8080 -p 6379:6379 -v $volume:/root/.cache/huggingface/hub/ morgulio/vllm:0.2.0 Before start specify vars model and volume as you need. — Reply to this email directly, view it on GitHub <#1293 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AHVJPWZUMUQHZIH734SQOADX6KOAPAVCNFSM6AAAAAA5XM6RNWVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTONJSGAYTSNJWHE> . You are receiving this because you authored the thread.Message ID: ***@***.***>

MaxZabarka · 2023-10-08T17:34:13Z

Is there a Dockerfile anywhere that is successful in building vLLM?

agrogov · 2023-10-08T19:12:50Z

@sureshbhusare RTX 3090 FE

sureshbhusare · 2023-10-09T01:04:14Z

**agrogov ** commented Oct 8, 2023 •

This does not work. Nvidia Driver error.

agrogov · 2023-10-09T07:46:27Z

@sureshbhusare do you have CUDA & Nvidia Docker Toolkit installed on the host?
It's mandatory condition.

olihough86 · 2023-10-09T09:03:36Z

I had a working Dockerfile but it's now broken, a recent commit is causing a CUDA mismatch, this image uses a CUDA 1.8 base.

The detected CUDA version (11.8) mismatches the version that was used to compile
PyTorch (12.1). Please make sure to use the same CUDA versions._

So something is causing pytorch compiled against 12.1 to be installed

FROM runpod/pytorch:2.0.1-py3.10-cuda11.8.0-devel
RUN git clone https://github.com/vllm-project/vllm.git && cd vllm && pip install .

Dockerfile is only two lines, vLLM is supposed to require CUDA 11.8 as per the docs, has this changed?

djmaze · 2023-10-14T17:29:58Z

@olihough86 Just built your Dockerfile and it works for me.

UPDATE: It fails at the v0.2.0 tag but works on main.

agt · 2023-10-17T16:11:35Z

My site prefers Nvidia's conda channel for CUDA over the NVCR images - our vLLM Dockerfile is available @ https://github.com/ucsd-ets/traip-vllm if anybody's interested in that approach.

thearchitectxy · 2023-10-18T02:34:28Z

My site prefers Nvidia's conda channel for CUDA over the NVCR images - our vLLM Dockerfile is available @ https://github.com/ucsd-ets/traip-vllm if anybody's interested in that approach.

Can this dockerFIle be built on any pc(i am on a macbook) and push to registry? @agt having issue building locally only, the idea is the push to ECR and then run it via kubernetes deployment but getting this error


300.3 terminate called after throwing an instance of 'std::length_error'
300.3   what():  basic_string::_M_create
300.4 Aborted
------
Dockerfile:32
--------------------
  31 |     
  32 | >>> RUN . /opt/conda/bin/activate && \
  33 | >>>     mamba env create -p /opt/vllm -f /root/vllm-environment.yml
  34 |     
--------------------
ERROR: failed to solve: process "/bin/sh -c . /opt/conda/bin/activate &&     mamba env create -p /opt/vllm -f /root/vllm-environment.yml" did not complete successfully: exit code: 134

is it also possible for you to put it on Docker hub to prevent building locally? @agt

skrider mentioned this issue Oct 14, 2023

Add dockerfile #1350

Merged

esmeetu closed this as completed Mar 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Docker image #1293

Docker image #1293

sureshbhusare commented Oct 8, 2023

MaxZabarka commented Oct 8, 2023

casper-hansen commented Oct 8, 2023

agrogov commented Oct 8, 2023 •

edited

Loading

sureshbhusare commented Oct 8, 2023 via email

MaxZabarka commented Oct 8, 2023

agrogov commented Oct 8, 2023

sureshbhusare commented Oct 9, 2023

agrogov commented Oct 9, 2023

olihough86 commented Oct 9, 2023 •

edited

Loading

djmaze commented Oct 14, 2023 •

edited

Loading

agt commented Oct 17, 2023

thearchitectxy commented Oct 18, 2023 •

edited

Loading

Docker image #1293

Docker image #1293

Comments

sureshbhusare commented Oct 8, 2023

MaxZabarka commented Oct 8, 2023

casper-hansen commented Oct 8, 2023

agrogov commented Oct 8, 2023 • edited Loading

sureshbhusare commented Oct 8, 2023 via email

MaxZabarka commented Oct 8, 2023

agrogov commented Oct 8, 2023

sureshbhusare commented Oct 9, 2023

agrogov commented Oct 9, 2023

olihough86 commented Oct 9, 2023 • edited Loading

djmaze commented Oct 14, 2023 • edited Loading

agt commented Oct 17, 2023

thearchitectxy commented Oct 18, 2023 • edited Loading

agrogov commented Oct 8, 2023 •

edited

Loading

olihough86 commented Oct 9, 2023 •

edited

Loading

djmaze commented Oct 14, 2023 •

edited

Loading

thearchitectxy commented Oct 18, 2023 •

edited

Loading