Train multiple models in a single GPU on parallel #10204

Programmer-RD-AI · 2021-10-28T03:31:32Z

Discussed in #10159

^{Originally posted by grudloff October 27, 2021}
Is there a recommended way of training multiple models in parallel in a single GPU? I tried using joblib's Parallel & delayed but I got a CUDA OOM with two instances even though a single model uses barely a fourth of the total memory. And is a speedup compared to sequential calling expected?

The text was updated successfully, but these errors were encountered:

Programmer-RD-AI · 2021-10-28T03:31:44Z

Solution https://pytorch-lightning.readthedocs.io/en/stable/advanced/multi_gpu.html

Programmer-RD-AI · 2021-10-28T03:32:12Z

Solution #2807

Programmer-RD-AI · 2021-10-28T03:32:41Z

Solution https://wandb.ai/wandb/wandb-lightning/reports/Multi-GPU-Training-Using-PyTorch-Lightning--VmlldzozMTk3NTk

awaelchli · 2021-10-29T03:16:48Z

Hey @Programmer-RD-AI
Please don't duplicate posts from the discussion forum here into GitHub issues.

GitHub issues are for:

Bug reports
Feature requests
Anything related to the development of Lightning

GitHub issues are not:

For pure question answering / implementation help
Broad discussions not directly related to the PL development

Thanks for your understanding.

awaelchli closed this as completed Oct 29, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Train multiple models in a single GPU on parallel #10204

Train multiple models in a single GPU on parallel #10204

Programmer-RD-AI commented Oct 28, 2021

Programmer-RD-AI commented Oct 28, 2021

Programmer-RD-AI commented Oct 28, 2021

Programmer-RD-AI commented Oct 28, 2021

awaelchli commented Oct 29, 2021

Train multiple models in a single GPU on parallel #10204

Train multiple models in a single GPU on parallel #10204

Comments

Programmer-RD-AI commented Oct 28, 2021

Discussed in #10159

Programmer-RD-AI commented Oct 28, 2021

Programmer-RD-AI commented Oct 28, 2021

Programmer-RD-AI commented Oct 28, 2021

awaelchli commented Oct 29, 2021