ggml-cuda.cu:3211: ERROR: CUDA kernel vec_dot_q5_K_q8_1_impl_vmmq #5686

DanCard · 2024-02-23T16:27:43Z

Discussed in #5685

^{Originally posted by DanCard February 23, 2024}
ggml-cuda.cu:3211: ERROR: CUDA kernel vec_dot_q5_K_q8_1_impl_vmmq has no device code compatible with CUDA arch 520. ggml-cuda.cu was compiled for: 520
This worked yesterday. I did a git pull, make clean, make and then get this error today.
nvidia rtx 3090
System: debian testing
Command line:
~/github/llama.cpp/main -m ~/models/miqu-1-70b.q5_K_M.gguf -c 0 -i --color -t 16 --n-gpu-layers 24 --temp 0.8 -p "bob"

I reverted previous two commits and issue went away.
~/github/llama.cpp$ git reset --hard HEAD~2
HEAD is now at 334f76f sync : ggml

The text was updated successfully, but these errors were encountered:

DanCard · 2024-02-24T20:32:26Z

This is no longer an issue with latest update.

szymonrucinski · 2024-03-08T12:51:10Z

Hey I still keep getting this error for the server. I am on the main branch as well.

DanCard closed this as completed Feb 24, 2024

Avlyssna mentioned this issue Apr 7, 2024

__CUDA_ARCH__ macro is unreliable #6529

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ggml-cuda.cu:3211: ERROR: CUDA kernel vec_dot_q5_K_q8_1_impl_vmmq #5686

ggml-cuda.cu:3211: ERROR: CUDA kernel vec_dot_q5_K_q8_1_impl_vmmq #5686

DanCard commented Feb 23, 2024 •

edited

Loading

DanCard commented Feb 24, 2024

szymonrucinski commented Mar 8, 2024

ggml-cuda.cu:3211: ERROR: CUDA kernel vec_dot_q5_K_q8_1_impl_vmmq #5686

ggml-cuda.cu:3211: ERROR: CUDA kernel vec_dot_q5_K_q8_1_impl_vmmq #5686

Comments

DanCard commented Feb 23, 2024 • edited Loading

Discussed in #5685

DanCard commented Feb 24, 2024

szymonrucinski commented Mar 8, 2024

DanCard commented Feb 23, 2024 •

edited

Loading