Support for NVIDIA GPUs under Docker Compose #6691

collabnix · 2019-05-09T18:28:22Z

Under Docker 19.03.0 Beta 2, support for NVIDIA GPU has been introduced in the form of new CLI API --gpus. docker/cli#1714 talk about this enablement.

Now one can simply pass --gpus option for GPU-accelerated Docker based application.

$ docker run -it --rm --gpus all ubuntu nvidia-smi
Unable to find image 'ubuntu:latest' locally
latest: Pulling from library/ubuntu
f476d66f5408: Pull complete 
8882c27f669e: Pull complete 
d9af21273955: Pull complete 
f5029279ec12: Pull complete 
Digest: sha256:d26d529daa4d8567167181d9d569f2a85da3c5ecaf539cace2c6223355d69981
Status: Downloaded newer image for ubuntu:latest
Tue May  7 15:52:15 2019       
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 390.116                Driver Version: 390.116                   |
|-------------------------------+----------------------+----------------------+
| GPU  Name        Persistence-M| Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util  Compute M. |
|===============================+======================+======================|
|   0  Tesla P4            Off  | 00000000:00:04.0 Off |                    0 |
| N/A   39C    P0    22W /  75W |      0MiB /  7611MiB |      0%      Default |
+-------------------------------+----------------------+----------------------+

+-----------------------------------------------------------------------------+
| Processes:                                                       GPU Memory |
|  GPU       PID   Type   Process name                             Usage      |
|=============================================================================|
|  No running processes found                                                 |
+-----------------------------------------------------------------------------+
:~$

As of today, Compose doesn't support this. This is a feature request for enabling Compose to support for NVIDIA GPU.

The text was updated successfully, but these errors were encountered:

qhaas · 2019-07-24T00:17:07Z

This is of increased importance now that the (now) legacy 'nvidia runtime' appears broken with Docker 19.03.0 and nvidia-container-toolkit-1.0.0-2: NVIDIA/nvidia-docker#1017

$ cat docker-compose.yml 
version: '2.3'

services:
 nvidia-smi-test:
  runtime: nvidia
  image: nvidia/cuda:9.2-runtime-centos7

$ docker-compose run nvidia-smi-test
Cannot create container for service nvidia-smi-test: Unknown runtime specified nvidia

This works: docker run --gpus all nvidia/cudagl:9.2-runtime-centos7 nvidia-smi

This does not: docker run --runtime=nvidia nvidia/cudagl:9.2-runtime-centos7 nvidia-smi

michaelnordmeyer · 2019-07-24T09:14:33Z

Any work happening on this?

I got the new Docker CE 19.03.0 on a new Ubuntu 18.04 LTS machine, have the current and matching NVIDIA Container Toolkit (née nvidia-docker2) version, but cannot use it because docker-compose.yml 3.7 doesn't support the --gpus flag.

akiross · 2019-07-24T12:13:22Z

Is there a workaround for this?

kiendang · 2019-07-28T07:59:04Z

This works: docker run --gpus all nvidia/cudagl:9.2-runtime-centos7 nvidia-smi

This does not: docker run --runtime=nvidia nvidia/cudagl:9.2-runtime-centos7 nvidia-smi

You need to have

{
    "runtimes": {
        "nvidia": {
            "path": "/usr/bin/nvidia-container-runtime",
            "runtimeArgs": []
        }
    }
}

in your /etc/docker/daemon.json for --runtime=nvidia to continue working. More info here.

VanDavv · 2019-08-09T19:49:08Z

ping @KlaasH @ulyssessouza @Goryudyuma @chris-crone . Any update on this?

iedmrc · 2019-08-13T14:42:54Z

It is an urgent need. Thank you for your effort!

Daniel451 · 2019-08-16T15:21:45Z

Is it intended to have user manually populate /etc/docker/daemon.json after migrating to docker >= 19.03 and removing nvidia-docker2 to use nvidia-container-toolkit instead?

It seems that this breaks a lot of installations. Especially, since --gpus is not available in compose.

andyneff · 2019-08-16T15:31:48Z

No, this is a work around for until compose does support the gpus flag.

uderik · 2019-08-27T10:42:32Z

install nvidia-docker-runtime:
https://github.com/NVIDIA/nvidia-container-runtime#docker-engine-setup
add to /etc/docker/daemon.json
{
"runtimes": {
"nvidia": {
"path": "/usr/bin/nvidia-container-runtime",
"runtimeArgs": []
}
}
}

docker-compose:
runtime: nvidia
environment:
- NVIDIA_VISIBLE_DEVICES=all

Kwull · 2019-08-27T14:45:02Z

There is no such thing like "/usr/bin/nvidia-container-runtime" anymore. Issue is still critical.

uderik · 2019-08-27T14:49:58Z

it will help run nvidia environment with docker-compose, untill fix docker-compose

cheperuiz · 2019-08-27T17:35:24Z

install nvidia-docker-runtime:
https://github.com/NVIDIA/nvidia-container-runtime#docker-engine-setup
add to /etc/docker/daemon.json
{
"runtimes": {
"nvidia": {
"path": "/usr/bin/nvidia-container-runtime",
"runtimeArgs": []
}
}
}

docker-compose:
runtime: nvidia
environment:

NVIDIA_VISIBLE_DEVICES=all

This is not working for me, still getting the Unsupported config option for services.myservice: 'runtime' when trying to run docker-compose up

any ideas?

uderik · 2019-08-27T17:38:40Z

This is not working for me, still getting the Unsupported config option for services.myservice: 'runtime' when trying to run docker-compose up

any ideas?

after modify /etc/docker/daemon.json, restart docker service
systemctl restart docker
use Compose format 2.3 and add runtime: nvidia to your GPU service. Docker Compose must be version 1.19.0 or higher.
docker-compose file:
version: '2.3'

services:
nvsmi:
image: ubuntu:16.04
runtime: nvidia
environment:
- NVIDIA_VISIBLE_DEVICES=all
command: nvidia-smi

Kwull · 2019-08-27T17:52:02Z

@cheperuiz, you can set nvidia as default runtime in daemon.json and will not be dependent on docker-compose. But all you docker containers will use nvidia runtime - I have no issues so far.
{ "default-runtime": "nvidia", "runtimes": { "nvidia": { "path": "/usr/bin/nvidia-container-runtime", "runtimeArgs": [] } }, }

cheperuiz · 2019-08-27T18:45:12Z

Ah! thank you @Kwull , i missed that default-runtime part... Everything working now :)

johncolby · 2019-08-28T07:17:17Z

@uderik, runtime is no longer present in the current 3.7 compose file format schema, nor in the pending 3.8 version that should eventually align with Docker 19.03: https://github.com/docker/compose/blob/5e587d574a94e011b029c2fb491fb0f4bdeef71c/compose/config/config_schema_v3.8.json

andyneff · 2019-08-28T13:58:10Z

@johncolby runtime has never been a 3.x flag. It's only present in the 2.x track, (2.3 and 2.4).

cheperuiz · 2019-08-28T14:42:15Z

Yeah, I know, and even though my docker-compose.yml file includes the version: '2.3' (which have worked in the past) it seems to be ignored by the latest versions...
For future projects, what would be the correct way to enable/disable access to the GPU? just making it default + env variables? or will there be support for the --gpus flag?

Daniel451 · 2019-08-30T14:39:09Z

@johncolby what is the replacement for runtime in 3.X?

johncolby · 2019-08-30T15:37:12Z

@Daniel451 I've just been following along peripherally, but it looks like it will be under the generic_resources key, something like:

services:
  my_app:
    deploy:
      resources:
        reservations:
          generic_resources:
            - discrete_resource_spec:
                kind: 'gpu'
                value: 2

(from https://github.com/docker/cli/blob/9a39a1/cli/compose/loader/full-example.yml#L71-L74)
Design document here: https://github.com/docker/swarmkit/blob/master/design/generic_resources.md

Here is the compose issue regarding compose 3.8 schema support, which is already merged in: #6530

On the daemon side the gpu capability can get registered by including it in the daemon.json or dockerd CLI (like the previous hard-coded runtime workaround), something like

/usr/bin/dockerd --node-generic-resource gpu=2

which then gets registered by hooking into the NVIDIA docker utility:
https://github.com/moby/moby/blob/09d0f9/daemon/nvidia_linux.go

It looks like the machinery is basically in place, probably just needs to get documented...

chongyi-zheng · 2019-09-01T11:14:22Z

Any update?

statikkkkk · 2019-09-05T20:57:51Z

Also waiting on updates, using bash with docker run --gpus until the official fix...

celbirlik · 2019-09-09T08:07:37Z

Waiting for updates asw ell.

vk1z · 2021-02-09T03:11:01Z

To fix install the nvidia-container-toolkit(https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/install-guide.html#setting-up-nvidia-container-toolkit)

docker-compose --version

docker-compose version 1.28.2, build 6763035

Compose file: services:
docker-compose-now-supports-device-requests:
image: nvidia/cuda:11.0-base
command: nvidia-smi
deploy:
resources:
reservations:
devices:
- capabilities:
- gpu
docker-compose up
==
docker-compose up
Building with native build. Learn about native build in Compose here: https://docs.docker.com/go/compose-native-build/
Removing vkurien_docker-compose-now-supports-device-requests_1
Recreating 30ae4cbfb9c1_vkurien_docker-compose-now-supports-device-requests_1 ...
Attaching to vkurien_docker-compose-now-supports-device-requests_1
docker-compose-now-supports-device-requests_1 | Tue Feb 9 03:00:36 2021
docker-compose-now-supports-device-requests_1 | +-----------------------------------------------------------------------------+
docker-compose-now-supports-device-requests_1 | | NVIDIA-SMI 460.39 Driver Version: 460.39 CUDA Version: 11.2 |
docker-compose-now-supports-device-requests_1 | |-------------------------------+----------------------+----------------------+
docker-compose-now-supports-device-requests_1 | | GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
docker-compose-now-supports-device-requests_1 | | Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
docker-compose-now-supports-device-requests_1 | | | | MIG M. |
docker-compose-now-supports-device-requests_1 | |===============================+======================+======================|
docker-compose-now-supports-device-requests_1 | | 0 GeForce GTX 107... Off | 00000000:07:00.0 On | N/A |
docker-compose-now-supports-device-requests_1 | | 0% 58C P8 19W / 180W | 500MiB / 8111MiB | 0% Default |
docker-compose-now-supports-device-requests_1 | | | | N/A |
docker-compose-now-supports-device-requests_1 | +-------------------------------+----------------------+----------------------+
docker-compose-now-supports-device-requests_1 |
docker-compose-now-supports-device-requests_1 | +-----------------------------------------------------------------------------+
docker-compose-now-supports-device-requests_1 | | Processes: |
docker-compose-now-supports-device-requests_1 | | GPU GI CI PID Type Process name GPU Memory |
docker-compose-now-supports-device-requests_1 | | ID ID Usage |
docker-compose-now-supports-device-requests_1 | |=============================================================================|
docker-compose-now-supports-device-requests_1 | +-----------------------------------------------------------------------------+
vkurien_docker-compose-now-supports-device-requests_1 exited with code 0

Motophan · 2021-02-09T03:20:09Z

To fix install the nvidia-container-toolkit(https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/install-guide.html#setting-up-nvidia-container-toolkit)

docker-compose --version

docker-compose version 1.28.2, build 6763035

Compose file: services:
docker-compose-now-supports-device-requests:
image: nvidia/cuda:11.0-base
command: nvidia-smi
deploy:
resources:
reservations:
devices:

capabilities:

gpu

docker-compose up
docker-compose up
Building with native build. Learn about native build in Compose here: https://docs.docker.com/go/compose-native-build/
Removing vkurien_docker-compose-now-supports-device-requests_1
Recreating 30ae4cbfb9c1_vkurien_docker-compose-now-supports-device-requests_1 ...
Attaching to vkurien_docker-compose-now-supports-device-requests_1
docker-compose-now-supports-device-requests_1 | Tue Feb 9 03:00:36 2021
docker-compose-now-supports-device-requests_1 | +-----------------------------------------------------------------------------+
docker-compose-now-supports-device-requests_1 | | NVIDIA-SMI 460.39 Driver Version: 460.39 CUDA Version: 11.2 |
docker-compose-now-supports-device-requests_1 | |-------------------------------+----------------------+----------------------+
docker-compose-now-supports-device-requests_1 | | GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
docker-compose-now-supports-device-requests_1 | | Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
docker-compose-now-supports-device-requests_1 | | | | MIG M. |
docker-compose-now-supports-device-requests_1 | |===============================+======================+======================|
docker-compose-now-supports-device-requests_1 | | 0 GeForce GTX 107... Off | 00000000:07:00.0 On | N/A |
docker-compose-now-supports-device-requests_1 | | 0% 58C P8 19W / 180W | 500MiB / 8111MiB | 0% Default |
docker-compose-now-supports-device-requests_1 | | | | N/A |
docker-compose-now-supports-device-requests_1 | +-------------------------------+----------------------+----------------------+
docker-compose-now-supports-device-requests_1 |
docker-compose-now-supports-device-requests_1 | +-----------------------------------------------------------------------------+
docker-compose-now-supports-device-requests_1 | | Processes: |
docker-compose-now-supports-device-requests_1 | | GPU GI CI PID Type Process name GPU Memory |
docker-compose-now-supports-device-requests_1 | | ID ID Usage |
docker-compose-now-supports-device-requests_1 | |=============================================================================|
docker-compose-now-supports-device-requests_1 | +-----------------------------------------------------------------------------+
vkurien_docker-compose-now-supports-device-requests_1 exited with code 0

That won't actually work, I know it looks like it will work but it will not work. Tested with a p2000 do you need logs?

vk1z · 2021-02-09T03:27:31Z

@Motophan : Define "will not work", it did just work on my machine (ubuntu 18.04, gtx 1070), a moment ago if you take a closer look at what I attached. Try this command for instance:
sudo docker run --rm --gpus all nvidia/cuda:11.0-base nvidia-smi

Tell me what you get after installing nvidia container toolkit and restarting docker daemon.

estimadarocha · 2021-02-09T08:10:39Z

@vk1z so as far as i understand from your statements we still need to install nvidia-container-toolkit?

i am running:

Docker version 20.10.3, build 48d30b5
docker-compose version 1.28.2, build 6763035

Update:

After installing nvidia-container-toolkit i can run nvidia/cuda docker and run nvidia-smi.

But...

When trying plex as @Motophan said i can't have access gpus

services:
plex:
deploy:
resources:
reservations:
devices:
- capabilities:
- gpu

and if i install portainer and look at i can't see GPU line in container details as mentioned here portainer/portainer#4791 (comment) by @xAt0mZ

vk1z · 2021-02-09T12:26:50Z

@estimadarocha : I am afraid that I don't know about portainer. But I do have some questions for you:

Your understanding on whether we have to run nvidia-container-toolkit is correct.
Did you try the compose file that I had set up earlier (with the right nvidia/cuda image?
Did sudo docker run --rm --gpus all nvidia/cuda:11.0-base nvidia-smi work?
Did nvidia-smi run in the container and produce correct output using the compose file?

v

xAt0mZ · 2021-02-09T13:51:21Z

@vk1z Portainer is a GUI to manage docker/kubernetes endpoints (clusters or standalone) to lower the CLI learning hassle

@estimadarocha it's a pull request not merged nor released yet. But it's doing basically the same as using the --gpus CLI option. So if it's not working on your env with CLI, it will not work with Portainer either

estimadarocha · 2021-02-09T14:13:47Z

@vk1z

yes
yes
yes
yes

@xAt0mZ i tought the portainer implementation is finished.

vk1z · 2021-02-09T14:17:51Z

@estimadarocha : Thanks for confirming. Therefore it seems to me that from the docker-compose point of view, we are good.

Xefir · 2021-02-09T14:19:51Z

@estimadarocha What image do you use for Plex ?

The image has to be optimised for this kind of work. For example, the official image of Plex is NOT compatible with GPUs, regardless of flags you pass on Docker or docker-compose file. Guys from linuxserver has done some work to be able to use the GPU, so try their image instead.

fmoledina · 2021-02-09T14:24:50Z

@Xefir I use the official Plex image for GPU transcoding using the nvidia-docker2 runtime. Are you saying that using the --gpus flag wouldn't work?

euri10 · 2021-02-09T16:22:22Z

I use the official plex image and run the nvidia container just fine with the latest docker-compose, hw encoding works fine, no need to use the linuxserver image, you just have to remember to pass the env vars they advertise

…

On Tue, Feb 9, 2021 at 3:25 PM Faisal Moledina ***@***.***> wrote: @Xefir <https://github.com/Xefir> I use the official Plex image for GPU transcoding using the nvidia-docker2 runtime. Are you saying that using the --gpus flag wouldn't work? — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#6691 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAINSPW5MJ6WUHPNQITRXGLS6FAUXANCNFSM4HL45G6Q> .

-- benoit barthelet http://pgp.mit.edu/pks/lookup?op=get&search=0xF150E01A72F6D2EE

estimadarocha · 2021-02-09T21:31:33Z

@Xefir I use linuxserver

The question here is related to docker compose...
Or better what's the needed options that we need to have present on compose.

Is only these ones:

services:
plex:
deploy:
resources:
reservations:
devices:

capabilities:
gpu

This equal to gpus -all on direct command line?

Is this enought?

Thanks

C84186 · 2021-02-10T02:41:16Z

Problem Statement - How to enable HW accellerated transcoding for mediaserver (Jellyfin, etc)

I'm interesting in running a similar media server (jellyfin) w/ HW encoding via docker compose.

I would rather not have to build a GPU compatible image from scratch based off the base images, and instead continue using the standard jellyfin images.

Outdated - I've answered my question below

It's encouraging to see people suggesting that you don't need a runtime specific image to make this work.

I'm not clear on what I do need to define for my media service in my compose spec in order to have it work correctly.

Could anyone please provide a minimum working example of a mediaserver (I'd prefer jellyfin, but beggars can't be choosers) leveraging this runtime?

Or does #6691 (comment) mean that this runtime isn't even strictly necessary to utilize GPU in your compose services? If so, very exciting.

docker-compose.yml that doesn't work

  jellyfin:
   image: jellyfin/jellyfin
   runtime: nvidia
   environment:
     NVIDIA_VISIBLE_DEVICES: all
   deploy:
      resources:
        reservations:
          devices:
            - capabilities:
              - gpu

or do I also need to set runtime ?

Update

The above was not sufficient - When I needed accellerated transcoding, I'd lose the stream.
The jellyfin logs told me ffmpeg was failing:

jellyfin logs

[2021-02-10 14:08:21.598 +11:00] [INF] [34] MediaBrowser.Api.Playback.Hls.DynamicHlsService: /usr/lib/jellyfin-ffmpeg/ffmpeg -c:v h264_cuvid -resize 720x404 -i file:"/path/to/file" -map_metadata -1 -map_chapters -1 -threads 0 -map 0:0 -map 0:1 -map -0:s -codec:v:0 h264_nvenc -pix_fmt yuv420p -preset default -b:v 1878633 -maxrate 1878633 -bufsize 3757266 -profile:v high  -g 72 -keyint_min 72 -sc_threshold 0 -start_at_zero -vsync -1 -codec:a:0 libmp3lame -ac 2 -ab 121367  -copyts -avoid_negative_ts disabled -f hls -max_delay 5000000 -hls_time 3 -individual_header_trailer 0 -hls_segment_type mpegts -start_number 0 -hls_segment_filename "/config/data/transcodes/8e249c1ea338e1b1054a42cbe728d068%d.ts" -hls_playlist_type vod -hls_list_size 0 -y "/config/data/transcodes/8e249c1ea338e1b1054a42cbe728d068.m3u8"
[2021-02-10 14:08:21.636 +11:00] [ERR] [25] MediaBrowser.Api.Playback.Hls.DynamicHlsService: FFMpeg exited with code 1
[2021-02-10 14:08:21.710 +11:00] [WRN] [32] MediaBrowser.Api.Playback.Hls.DynamicHlsService: cannot serve "/config/data/transcodes/8e249c1ea338e1b1054a42cbe728d0680.ts" as transcoding quit before we got there

the ffmpeg logs gave me "Operation not permitted":

FFMPEG logs

[h264_cuvid @ 0x55eb835ca4c0] Cannot load libnvcuvid.so.1
[h264_cuvid @ 0x55eb835ca4c0] Failed loading nvcuvid.
Stream mapping:
  Stream #0:0 -> #0:0 (h264 (h264_cuvid) -> h264 (h264_nvenc))
  Stream #0:1 -> #0:1 (aac (native) -> mp3 (libmp3lame))
Error while opening decoder for input stream #0:0 : Operation not permitted

Fortunately, the fix was easy - this reddit thread gave the answer - the following works:

Working compose definition for hardware transcoding

# docker-compose.override.yml
# my volumes, ports, traefik, most of the "standard" jellyfin env is set elsewhere

  jellyfin:
   image: jellyfin/jellyfin
   runtime: nvidia
   environment:
     NVIDIA_VISIBLE_DEVICES: all
     NVIDIA_DRIVER_CAPABILITIES: all
   deploy:
      resources:
        reservations:
          devices:
            - capabilities:
              - gpu

To be clear - the above is actually from my docker-compose.override file, so isn't a full reprex for running this service.
You can easily substitute in your plex, emby, etc for this, however:

# docker-compose.override.yml

version: "2.4"
services:
  YOUR-SERVICE-NAME:
   runtime: nvidia
   environment:
     NVIDIA_VISIBLE_DEVICES: all
     NVIDIA_DRIVER_CAPABILITIES: all
   deploy:
      resources:
        reservations:
          devices:
            - capabilities:
              - gpu

I'm not actually clear on what parts of the above service defintion I actually need.

Also, I assume it's possible to set more fine-grained control on what driver capabilities your service needs, transcoding, machine learning acceleration etc, but I don't know that I care.

Update: Based off of #6691 (comment) , the following value should suffice:

NVIDIA_DRIVER_CAPABILITIES: 'compute,video,utility'

I think there's redundancy in the use of the runtime: + deploy settings, but hey, if it aint broke...

vk1z · 2021-02-10T04:01:09Z

@C84186: Thanks for your work. Frankly this points out a need for more compose "recipes". This thread is serving as a substitute for documentation, alas.

euri10 · 2021-02-10T06:00:26Z

here's my docker-compose for plex official image that uses hw encoding fine (just edited useless parts).

the only thing I can comment on is that without the 2 NVIDIA env variables (which happen to be mentioned in the linuxserver image doc) there was no hw encoding happening, hope this helps

version: "3.8"
services:
  plex:
    image: plexinc/pms-docker:1.21.3.4014-58bd20c02
    runtime: nvidia
    deploy:
      resources:
        reservations:
          devices:
            - capabilities:
              - gpu
    environment:
     - TZ=Europe/Paris
     - PLEX_CLAIM=claim-xxx
     - ADVERTISE_IP=https://xxx:443
     - NVIDIA_VISIBLE_DEVICES=all
     - NVIDIA_DRIVER_CAPABILITIES=compute,video,utility
    volumes:
      - /home/xxx/plex/config:/config
      - /home/xxx/plex/transcode/:/transcode
    ports:
      - 32400:32400
    networks:
      - traefik-local
    labels:
      - "traefik.enable=true"
      - "traefik.http.routers.plex.rule=Host(`xxx`)"
      - "traefik.http.routers.plex.entrypoints=websecure"
      - "traefik.http.routers.plex.tls.certresolver=myhttpchallenge"
      - "traefik.http.services.plex.loadbalancer.server.port=32400"
    restart: unless-stopped
networks:
  traefik-local:
    external: true

RyanHakurei · 2021-02-11T06:15:27Z

@C84186 I don't think you need runtime: nvidia and iirc the old Nvidia Runtime was deprecated in favor of Nvidia Container Toolkit anyway. I have that omitted and hardware transcoding is working for me:

version: "3.3"
services:
...
# Plex Media Server
    plex:
        restart: always
        container_name: Plex
        network_mode: host
        deploy:
           resources:
             reservations:
               devices:
                 - capabilities:
                   - gpu
        labels:
            - com.centurylinklabs.watchtower.enable=true
        environment:
            - PLEX_CLAIM=Lolno
            - PUID=1000
            - PGID=1000
            - VERSION=latest
            - NVIDIA_VISIBLE_DEVICES=all
            - NVIDIA_DRIVER_CAPABILITIE=all
        volumes:
            - '/mnt/SSD/Sandbox/Plex/Data:/config'
            - '/mnt/Media/Sandbox/Plex/Library:/data/:ro'
            - '/mnt/Media/Sandbox/Plex/Prerolls:/Prerolls:ro'
            - '/mnt/Media/Sandbox/Plex/Sync:/transcode'
            - '/mnt/Media/LetsEncrypt:/Keys:ro'
        image: linuxserver/plex

estimadarocha · 2021-02-11T12:14:07Z

i confirm what @ryaniskira said.

when nvidia start to deprecate runtime: nvidia in favor of --gpus all this is what leads to all this needed changes on docker compose and portainer.

so if we use the new options:
deploy:
resources:
reservations:
devices:
- capabilities:
- gpu

runtime:nvidia shouldn't be used

vk1z · 2021-02-11T15:43:20Z

I can't speak to Plex but I don't seem to need the environment variables NVIDIA_VISIBLE_DEVICES or NVIDIA_DRIVER_CAPABILITIES

RyanHakurei · 2021-02-11T16:23:02Z

@vk1z I seem to need it, I passed the GPU to my Boinc container without passing those variables and it did not detect my GPU, added them to Boinc's compose and suddenly it started to download tasks from GPUGrid.

vk1z · 2021-02-11T17:12:12Z

@ryaniskira : Weird. Doesn't engender confidence TBH. Will have to look at this more closely.

Atralb · 2021-02-15T08:40:19Z

@ryaniskira @estimadarocha You guys are ignorant of the current state of nvidia-docker and claiming something you heard as true without ever having verified it yourselves.

the old Nvidia Runtime was deprecated in favor of Nvidia Container Toolkit anyway

This is completely false, and even a misunderstanding of the different entities in the nvidia container stack and how they interact together.

You should maybe read the official documentation sometimes: https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/arch-overview.html

As we can clearly see, runtime: nvidia is still there, and it is even precisely what is actually leveraged under the hood by the --gpus option.

The "nvidia runtime" is simply a piece of config in your daemon.json that asks to use the Nvidia container toolkit. The latter is absolutely not a replacement of the former, since they are both the same thing, just at different levels.

Motophan · 2021-02-15T10:26:47Z

That's great. How do we add it to our Plex/jellyfin containers @Atralb

…

On Sun, Feb 14, 2021, 11:40 PM Atralb ***@***.***> wrote: @ryaniskira <https://github.com/ryaniskira> @estimadarocha <https://github.com/estimadarocha> You guys are ignorant of the current state of nvidia-docker and claiming something you heard as true without ever having verified yourselves, actually not even understanding how the nvidia container stack works. the old Nvidia Runtime was deprecated in favor of Nvidia Container Toolkit anyway This is completely false, and even a misunderstanding of the different entities in the stack. You should maybe read the official documentation sometimes: https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/arch-overview.html As we can clearly see, runtime: nvidia is always there, and it is even precisely what is actually leveraged under the hood with the --gpus option. The "nvidia runtime" is simply a piece of config in your daemon.json that asks to use the Nvidia container toolkit. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#6691 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AKEIUFGCLG7YUJ3MHJIELYLS7DMZFANCNFSM4HL45G6Q> .

estimadarocha · 2021-02-15T10:45:30Z

@Atralb picking my ignorancy I will try to get some of my free time to have a close look at the info you post. Thanks for the info.

Meanwhile can you point us the best approach to use?

Atralb · 2021-02-15T15:11:21Z

@Motophan @estimadarocha Never set up a jellyfin/plex container yet. But since the new compose spec (docker-compose > 1.28) includes a runtime parameter, I simply use runtime: nvidia in my compose.yaml files for a tensorflow cuda container and my trainings work perfectly.

RyanHakurei · 2021-02-15T17:47:07Z

@Atralb Woah woah woah that's a lot of worlds for "I am fucking wrong and am going to look like an idiot while trying to grandstand above others" as even your own citation states:

With Docker 19.03+, this is fine because Docker directly invokes nvidia-container-toolkit when you pass it the --gpus option instead of relying on the nvidia-container-runtime as a proxy.

So bam, deprecated and no longer needed AS PER YOUR OWN DOCUMENTATION that you so gleefully suggest that I read. Nvidia-container-runtime is no longer needed to proxy things as Docker can directly invoke Nvidia-container-toolkit now. There's also the Archwiki which recommends as much in the Docker page:

Starting from Docker version 19.03, NVIDIA GPUs are natively supported as Docker devices. NVIDIA Container Toolkit is the recommended way of running containers that leverage NVIDIA GPUs.

but I guess they're wrong too huh?

EDIT: Also if you actually, you know, read the thread you would see people having issues trying to invoke the deprecated Nvidia-container-runtime.

Motophan · 2021-02-15T20:32:45Z

That's great, but all we want is a doc update w/ a working example for passing a GPU to Plex or jellyfin.

…

On Mon, Feb 15, 2021, 8:47 AM Ryan . ***@***.***> wrote: @Atralb <https://github.com/Atralb> Woah woah woah that's a lot of worlds for "I am fucking wrong and am going to look like an idiot while trying to grandstand above others" as even *your own citation* states: With Docker 19.03+, this is fine because Docker directly invokes nvidia-container-toolkit when you pass it the --gpus option instead of relying on the nvidia-container-runtime as a proxy. So bam, deprecated and no longer needed *AS PER YOUR OWN DOCUMENTATION* that you so gleefully suggest that I read. Nvidia-container-runtime is no longer needed to proxy things as Docker can directly invoke Nvidia-container-toolkit now. There's also the Archwiki which recommends as much in the Docker page <https://wiki.archlinux.org/index.php/Docker#Run_GPU_accelerated_Docker_containers_with_NVIDIA_GPUs> : Starting from Docker version 19.03, NVIDIA GPUs are natively supported as Docker devices. NVIDIA Container Toolkit is the recommended way of running containers that leverage NVIDIA GPUs. but I guess they're wrong too huh? EDIT: Also if you actually, you know, read the thread you would see people having issues trying to invoke the deprecated Nvidia-container-runtime <#6691 (comment)>. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#6691 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AKEIUFHKMNAHEQB5YAHNRHDS7FM3RANCNFSM4HL45G6Q> .

Atralb · 2021-02-16T08:18:28Z

@ryaniskira Lol, as we all know, all caps is the best argument indeed :).

You're saying a lot of words, but nowhere did you provide actual source that the "runtime" is deprecated. That's just your sole interpretation of what you're reading.

EDIT: Also if you actually, you know, read the thread you would see people having issues trying to invoke the deprecated Nvidia-container-runtime.

Again, showcasing your ignorance of the history of this issue. What you're linking was during the era of compose file v3, where runtime didn't exist, and people simply didn't read the documentation just like you.

All these comments are void since the new compose spec reintroduced the keyword, and that's exactly the issue here. People are recommending methods which are obsolete, which were developed by the community to fill this gap of runtime in v3, but have no point existing now. Which is precisely why I intervened.

But sure, getting all worked up cause you're wrong will surely make you right :).

chris-crone · 2021-02-16T09:34:00Z

Hi all, this thread is getting a bit heated. Let's remember there's a real person on the other side of each comment.

We've updated the official docs with instructions for how to get GPU support working with Compose: https://docs.docker.com/compose/gpu-support/

I've noticed that the prerequisites link there is broken (we'll fix it soon!), you'll need to follow these instructions: https://docs.docker.com/config/containers/resource_constraints/#gpu

I'll be locking this thread, please open a new issue if you've followed those instructions and run into an issue

collabnix changed the title ~~Add DeviceRequests to HostConfig to support NVIDIA GPUs under Docker Compose~~ Support for NVIDIA GPUs under Docker Compose May 9, 2019

jcsirot added the kind/enhancement label May 13, 2019

qhaas mentioned this issue Jul 24, 2019

Support runtime in v3.x compose files #6239

Closed

qhaas mentioned this issue Jul 24, 2019

nvidia-container-runtime-3.0.0-1.x86_64.rpm not compatible NVIDIA/nvidia-container-runtime#68

Closed

qhaas mentioned this issue Jul 31, 2019

Example of nvidia-docker2 with docker-compose NVIDIA/nvidia-docker#568

Closed

nemchik mentioned this issue Aug 4, 2019

nvidia-docker2 support GhostWriters/DockSTARTer#781

Closed

aronhelser mentioned this issue Aug 28, 2019

WIP: Some docker fixes Kitware/hpccloud-services#3

Merged

4 tasks

edurenye mentioned this issue Feb 13, 2021

Add docker compose GPU support iot-salzburg/gpu-jupyter#42

Closed

docker locked as too heated and limited conversation to collaborators Feb 16, 2021

Support for NVIDIA GPUs under Docker Compose #6691

Support for NVIDIA GPUs under Docker Compose #6691

Comments

collabnix commented May 9, 2019

qhaas commented Jul 24, 2019 • edited Loading

michaelnordmeyer commented Jul 24, 2019

akiross commented Jul 24, 2019

kiendang commented Jul 28, 2019

VanDavv commented Aug 9, 2019

iedmrc commented Aug 13, 2019

Daniel451 commented Aug 16, 2019

andyneff commented Aug 16, 2019

uderik commented Aug 27, 2019

Kwull commented Aug 27, 2019

uderik commented Aug 27, 2019

cheperuiz commented Aug 27, 2019

uderik commented Aug 27, 2019 • edited Loading

Kwull commented Aug 27, 2019

cheperuiz commented Aug 27, 2019

johncolby commented Aug 28, 2019

andyneff commented Aug 28, 2019

cheperuiz commented Aug 28, 2019

Daniel451 commented Aug 30, 2019

johncolby commented Aug 30, 2019

chongyi-zheng commented Sep 1, 2019

statikkkkk commented Sep 5, 2019

celbirlik commented Sep 9, 2019

vk1z commented Feb 9, 2021

Motophan commented Feb 9, 2021

docker-compose up

vk1z commented Feb 9, 2021

estimadarocha commented Feb 9, 2021 • edited Loading

vk1z commented Feb 9, 2021 • edited Loading

xAt0mZ commented Feb 9, 2021

estimadarocha commented Feb 9, 2021

vk1z commented Feb 9, 2021

Xefir commented Feb 9, 2021

fmoledina commented Feb 9, 2021

euri10 commented Feb 9, 2021 via email

estimadarocha commented Feb 9, 2021

C84186 commented Feb 10, 2021 • edited Loading

Problem Statement - How to enable HW accellerated transcoding for mediaserver (Jellyfin, etc)

Update

Working compose definition for hardware transcoding

vk1z commented Feb 10, 2021

euri10 commented Feb 10, 2021 • edited Loading

RyanHakurei commented Feb 11, 2021

estimadarocha commented Feb 11, 2021

vk1z commented Feb 11, 2021

RyanHakurei commented Feb 11, 2021

vk1z commented Feb 11, 2021

Atralb commented Feb 15, 2021 • edited Loading

Motophan commented Feb 15, 2021 via email

estimadarocha commented Feb 15, 2021

Atralb commented Feb 15, 2021 • edited Loading

RyanHakurei commented Feb 15, 2021

Motophan commented Feb 15, 2021 via email

Atralb commented Feb 16, 2021 • edited Loading

chris-crone commented Feb 16, 2021

qhaas commented Jul 24, 2019 •

edited

Loading

uderik commented Aug 27, 2019 •

edited

Loading

estimadarocha commented Feb 9, 2021 •

edited

Loading

vk1z commented Feb 9, 2021 •

edited

Loading

C84186 commented Feb 10, 2021 •

edited

Loading

euri10 commented Feb 10, 2021 •

edited

Loading

Atralb commented Feb 15, 2021 •

edited

Loading

Atralb commented Feb 15, 2021 •

edited

Loading

Atralb commented Feb 16, 2021 •

edited

Loading