Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ptxas fatal : Value 'sm_86' is not defined for option 'gpu-name' #6187

Closed
LucienBrule opened this issue Nov 2, 2020 · 13 comments
Closed

ptxas fatal : Value 'sm_86' is not defined for option 'gpu-name' #6187

LucienBrule opened this issue Nov 2, 2020 · 13 comments
Labels

Comments

@LucienBrule
Copy link

Environment

Windows Version:
    Microsoft Windows [Version 10.0.20246.1]

cat /etc/lsb-release:

    DISTRIB_ID=Ubuntu
    DISTRIB_RELEASE=20.04
    DISTRIB_CODENAME=focal
    DISTRIB_DESCRIPTION="Ubuntu 20.04.1 LTS"

Kernel:
    Linux version 4.19.128-microsoft-standard (oe-user@oe-host) (gcc version 8.2.0 (GCC)) 

wsl --list -v 
    * Ubuntu    Running         2

Driver Info:
    Nvidia Driver 460.20
    Card: RTX 3080

Steps to reproduce

  1. Install CUDA 11.1 Follow guide https://docs.nvidia.com/cuda/wsl-user-guide/index.html
  2. Install cuDNN v8.0.4 Follow guide https://docs.nvidia.com/deeplearning/cudnn/install-guide/index.html
  3. Setup Conda
$ conda create -n tf
$ conda activate tf
$ pip install tf-nightly-gpu
$ pip install jupyter
  1. Start jupyter server, copy paste blocks from this notebook https://www.tensorflow.org/hub/tutorials/image_feature_vector
  2. Go to "Train the network"

Expected behavior

Afaik I'm running the latest if not late enough builds of everything so, no warnings, just standard info.

Actual behavior

Massive console spam of

[timestamp] : W tensorflow/stream_executor/gpu/asm_compiler.cc:235] Your CUDA software stack is old. We fallback to the NVIDIA driver for some compilation. Update your CUDA version to get the best performance. The ptxas error was: ptxas fatal   : Value 'sm_86' is not defined for option 'gpu-name'

It still worked fine but seemed worth it to report.

Question

  • What in the CUDA stack makes it too old?
  • Should I downgrade CUDA in WSL to match something else?
@therealkenc therealkenc added the GPU label Nov 2, 2020
@cshang2017
Copy link

Has the same issue.

@gdh1995
Copy link

gdh1995 commented Nov 25, 2020

It's because tensorflow lacks supports for new GPUs, but not an error of WSL .

I find some related issues in the tensorflow repo, like tensorflow/tensorflow#43718

@Harsh188
Copy link

Harsh188 commented Nov 29, 2020

Hi, the latest version of TensorFlow supports CUDA 11 for the new GPUs. However, I'm getting a similar issue. Check out the issue I raised on the tensorflow repo.

@LucienBrule
Copy link
Author

Thank you all for your input, will look into it further

@WillLiGitHub
Copy link

same issue in Geforce rtx 3080

$ failed to create cublas handle: CUBLAS_STATUS_NOT_INITIALIZED
$ W tensorflow/stream_executor/gpu/asm_compiler.cc:235] Your CUDA software stack is old. We fallback to the NVIDIA driver for some compilation. Update your CUDA version to get the best performance. The ptxas error was: ptxas fatal : Value 'sm_86' is not defined for option 'gpu-name'

set system env can fix my problem:
os.environ["CUDA_VISIBLE_DEVICES"] = "1"

@Harsh188
Copy link

Hello,

I've found a software that makes TF 2.30 compatible with cudav11.1. It works only for Ubuntu 20.04 at the moment. If you are interested, then check out the article Install TensorFlow & PyTorch for the RTX 3090, 3080, 3070 by lambda stack.

@usr-ein
Copy link

usr-ein commented Mar 30, 2021

same issue in Geforce rtx 3080

$ failed to create cublas handle: CUBLAS_STATUS_NOT_INITIALIZED
$ W tensorflow/stream_executor/gpu/asm_compiler.cc:235] Your CUDA software stack is old. We fallback to the NVIDIA driver for some compilation. Update your CUDA version to get the best performance. The ptxas error was: ptxas fatal : Value 'sm_86' is not defined for option 'gpu-name'

set system env can fix my problem:
os.environ["CUDA_VISIBLE_DEVICES"] = "1"

I tried this solution, but all it does is stop using the GPU:

E tensorflow/stream_executor/cuda/cuda_driver.cc:328] failed call to cuInit: CUDA_ERROR_NO_DEVICE: 
no CUDA-capable device is detected

@westlake-moonlight
Copy link

I met this issue yesterday. It turns out that this problem is caused by the old "ptxas", which doesn't support RTX3090, 3080 GPUs.
On my computer, there were 2 cudas: cuda-11.0 and cuda-11.1 in the directory usr/local. The TensorFlow was trying to use the cuda-11.0 somehow, and the old "ptxas" in the folder cuda-11.0/bin is causing the problem.

Solution: I renamed the cuda-11.0 to something else, so that the TensorFlow can only find and use the "ptxas" in the cuda-11.1. After that, the problem is fixed: the warning was gone and everything is OK.

Environment: Ubuntu 20.04, TensorFlow 2.4, Cuda 11.1.1, CuDNN 8.05, Nvidia Driver Version: 455.32.

@johndpope
Copy link

johndpope commented Jul 15, 2021

I'm successfully running rtx 3090 with latest beta 470 driver + cuda 11.4 but need to use gcc9(stay away from 10.3 / system update)
I recommend using timeshift to save state when everything is working.
This morning I had to roll back a system update that made the graphics card driver downgrade to 460. crazy.

UPDATE - Sorry I'm not using WSL. just native dual boot.

@alon1samuel
Copy link

alon1samuel commented Jul 30, 2021

I'm also getting the same problem
Environment: Ubuntu 20.04, TensorFlow 2.4, Cuda 11.4.0, CuDNN 8.2.2.26, Nvidia Driver Version: 470.57.02, nvcc - 10.1

2021-07-30 18:18:03.339065: W tensorflow/stream_executor/gpu/asm_compiler.cc:235] Your CUDA software stack is old. We fallback to the NVIDIA driver for some compilation. Update your CUDA version to get the best performance. The ptxas error was: ptxas fatal : Value 'sm_86' is not defined for option 'gpu-name'

@talhaanwarch
Copy link

usr/local

renaming did not helped me

Copy link
Contributor

This issue has been automatically closed since it has not had any activity for the past year. If you're still experiencing this issue please re-file this as a new issue or feature request.

Thank you!

@xman3911
Copy link

I have the same issue when use Tensorflow2.4.1 with CUDA11.0. But it is solved when I use Tesorflow2.5.0 with CUDA11.2. So the reason maybe is the CUDA's version is too old.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests