Is multiple gpu supported with non-persistent pipeline #310

yaliqin · 2023-11-15T22:18:48Z

Hi team,

From the code I can see the multiple gpu deployment is supported for persistent deployment with server method. Is it supported with pipeline method?

Thanks

mrwyattii · 2023-11-16T20:20:13Z

Yes we support multi-GPU support for the mii.pipeline: https://github.com/microsoft/DeepSpeed-MII#tensor-parallelism

You will need to launch script using the deepspeed launcher to enable model parallelism with the pipeline:
deepspeed --num_gpus 2 pipeline.py

Where pipeline.py could be:

import mii
pipe = mii.pipeline("mistralai/Mistral-7B-v0.1")
response = pipe("DeepSpeed is", max_new_tokens=128)
print(response)

mrwyattii self-assigned this Nov 16, 2023

mrwyattii closed this as completed Dec 6, 2023

IKACE mentioned this issue Mar 27, 2024

[Benchmark] Change mii to use persistent deployment and support tensor parallel vllm-project/vllm#3628

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is multiple gpu supported with non-persistent pipeline #310

Is multiple gpu supported with non-persistent pipeline #310

yaliqin commented Nov 15, 2023

mrwyattii commented Nov 16, 2023

Is multiple gpu supported with non-persistent pipeline #310

Is multiple gpu supported with non-persistent pipeline #310

Comments

yaliqin commented Nov 15, 2023

mrwyattii commented Nov 16, 2023