[WIP] CUDA 10.0 and tensorflow 1.14 for docker install #682

dustindorroh · 2019-08-30T18:42:37Z

No description provided.

Dockerfile

nmanovic · 2019-09-01T17:57:04Z

components/cuda/docker-compose.cuda.yml

@@ -15,4 +15,4 @@ services:
    environment:
      NVIDIA_VISIBLE_DEVICES: all
      NVIDIA_DRIVER_CAPABILITIES: compute,utility
-      NVIDIA_REQUIRE_CUDA: "cuda>=9.0"
+      NVIDIA_REQUIRE_CUDA: "cuda>=10.0 brand=tesla,driver>=384,driver<385 brand=tesla,driver>=410,driver<411"


Does the string work for "Tesla" only?

Good question. It's not clear if you need to have tesla for using cuda for cuda 10.0 https://docs.nvidia.com/cuda/archive/10.0/cuda-toolkit-release-notes/index.html

But it may be more of a requirement of running nvidia-docker
https://github.com/NVIDIA/nvidia-docker/wiki/CUDA

I got this line from: https://gitlab.com/nvidia/container-images/cuda/blob/master/dist/ubuntu16.04/10.0/base/Dockerfile

nmanovic · 2019-09-01T18:10:08Z

components/cuda/install-cuda10-0.sh

+#
+# cuda 10.0 base    - https://gitlab.com/nvidia/cuda/blob/ubuntu16.04/10.0/base/Dockerfile
+# cuda 10.0 runtime - https://gitlab.com/nvidia/cuda/blob/ubuntu16.04/10.0/runtime/Dockerfile
+# cudnn7            - https://gitlab.com/nvidia/cuda/blob/ubuntu16.04/10.0/runtime/cudnn7/Dockerfile


I see that you combined all these docker files together and put here instructions. It will be difficult for us to support them because Nvidia can changes original files in the future and it could break our code.

@azhavoro , I'm thinking about a separate container which can execute some registered functions (e.g. TF annotation) using https://docs.python.org/3/library/xmlrpc.html. Could you please recommend something here?

@nmanovic I agree. I was following the previous example https://github.com/opencv/cvat/blob/develop/components/cuda/install.sh
I looked like to me the base runtime and cudnn was combined into one file. I was unsure if this was desired on your part, maybe having less docker layers was desired.
If there is interest in this I can write it this way. It may help with composing different cuda versions.

dustindorroh

I started this pull request not necessarily just to get it in, but to start a conversation about how you guys were thinking about support for new versions. My main motivation was to get full round trip tf_annotation, training using Tensorflow ObjectDetection Api, and manual adjustment.

Also as of CUDA 9 and above multiple cuda versions are backwards compatible.
https://docs.nvidia.com/deploy/cuda-compatibility/index.html

…all.sh

dustindorroh · 2019-09-03T20:53:09Z

If Codacy flags my single quote use with variables, this was intended. As I'm adding appending statements to variables to the ~/.bashrc.

nmanovic · 2019-09-05T11:44:12Z

@dustindorroh , internally we discussed a separate container for CUDA functionality long time ago. Thus it will be easy to add similar features in the future or modify the current one. It is a good approach from many point of views.

Also as a container we can add "training" part. Just need to invent an interface using https://docs.python.org/3/library/xmlrpc.html. Do you think you have time to help us with the feature? It is a long way. I believe discussion and implementation can take a couple of months but the result should be promising.

nmanovic · 2019-09-13T04:34:56Z

@dustindorroh , we made our next release and now have more time to discuss the feature. Could you please help us to move TF annotation into a separate container? In the separate container we can have any version of CUDA (versions of CUDA and TF can be build arguments). I hope you can come up with a proposal how to do that based on discuss in the thread. Don't hesitate to contact me directly if you have any questions. Do you think you can contribute the feature?

nmanovic · 2019-10-11T15:33:53Z

@dustindorroh , once again thanks for the contribution. We are not going to access the PR because it can lead to regressions. In the future we will have CUDA functionality into a separate container.

Update to cuda to 10.0 and tensorflow 1.14 for docker install

dd5ba20

nmanovic requested a review from azhavoro September 1, 2019 17:51

nmanovic reviewed Sep 1, 2019

View reviewed changes

Dockerfile Outdated Show resolved Hide resolved

nmanovic reviewed Sep 1, 2019

View reviewed changes

dustindorroh commented Sep 3, 2019

View reviewed changes

dustindorroh added 2 commits September 3, 2019 13:39

Update Dockerfile

3f1f7e9

renamed: components/cuda/install-cuda10-0.sh -> components/cuda/inst…

4be81b1

…all.sh

nmanovic changed the title ~~CUDA 10.0 and tensorflow 1.14 for docker install~~ [WIP] CUDA 10.0 and tensorflow 1.14 for docker install Sep 5, 2019

nmanovic closed this Oct 11, 2019

snyk-bot mentioned this pull request Oct 7, 2020

[Snyk] Upgrade jszip from 3.1.5 to 3.5.0 #2274

Merged

snyk-bot mentioned this pull request Jul 27, 2021

[Snyk] Security upgrade jszip from 3.1.5 to 3.7.0 hixio-mh/cvat#46

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] CUDA 10.0 and tensorflow 1.14 for docker install #682

[WIP] CUDA 10.0 and tensorflow 1.14 for docker install #682

dustindorroh commented Aug 30, 2019

nmanovic Sep 1, 2019

dustindorroh Sep 3, 2019

nmanovic Sep 1, 2019

dustindorroh Sep 3, 2019 •

edited

Loading

dustindorroh left a comment •

edited

Loading

dustindorroh commented Sep 3, 2019

nmanovic commented Sep 5, 2019

nmanovic commented Sep 13, 2019

nmanovic commented Oct 11, 2019

[WIP] CUDA 10.0 and tensorflow 1.14 for docker install #682

[WIP] CUDA 10.0 and tensorflow 1.14 for docker install #682

Conversation

dustindorroh commented Aug 30, 2019

nmanovic Sep 1, 2019

Choose a reason for hiding this comment

dustindorroh Sep 3, 2019

Choose a reason for hiding this comment

nmanovic Sep 1, 2019

Choose a reason for hiding this comment

dustindorroh Sep 3, 2019 • edited Loading

Choose a reason for hiding this comment

dustindorroh left a comment • edited Loading

Choose a reason for hiding this comment

dustindorroh commented Sep 3, 2019

nmanovic commented Sep 5, 2019

nmanovic commented Sep 13, 2019

nmanovic commented Oct 11, 2019

dustindorroh Sep 3, 2019 •

edited

Loading

dustindorroh left a comment •

edited

Loading