Kernel error for running example.py #1

zhuole1025 · 2024-11-08T04:50:24Z

Hi, thanks for this amazing work! When running the given example, I got the following error: RuntimeError: CUDA error: no kernel image is available for execution on the device (at /data/nunchaku/src/kernels/awq/gemv_awq.cu:312). I have followed the instructions for installation using torch 2.4.1.

The text was updated successfully, but these errors were encountered:

sxtyzhangzk · 2024-11-08T07:35:20Z

Hi, may I ask which GPU you are using? We currently support sm_86 (Ampere, RTX3090/A6000) and sm_89 (Ada, RTX4090). The kernel may run on sm_80 (A100) but expect a significant performance drop. If you want to try it on A100 you could edit setup.py and change arch=compute_86,code=sm_86 to arch=compute_80,code=sm_80.
Unfortunately, we don't support Turing (RTX20 series) and earlier architectures since we depend on FlashAttention. Hopper (H100) also does not work due to the lack of INT4 TensorCore.

zhuole1025 · 2024-11-08T14:12:20Z

Thanks for your explanation. Such a pity since I am using H100..

Ph0rk0z · 2024-11-09T12:16:50Z

Can you add an option for xformers or SDPA? If you used the AWQ kernel that supports older cards that's all it would take. Are the weights just standard gemv or is it custom?

bghira · 2024-11-11T14:54:00Z

may as well use a different inference engine if you're talking about using more generic kernels.

Ph0rk0z · 2024-11-12T14:50:29Z

There aren't a lot of kernels to choose from sadly. Custom kernels seem like the way to go for these transformer based models just like on the LLM side. Unfortunately everyone is using ampere only as the baseline. AWQ does have kernels working for previous implementations and there are other attention mechanisms.

zhuole1025 closed this as completed Nov 8, 2024

marvin-0042 mentioned this issue Nov 18, 2024

A100 RuntimeError: CUDA error: no kernel image is available for execution on the device #27

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Kernel error for running example.py #1

Kernel error for running example.py #1

zhuole1025 commented Nov 8, 2024

sxtyzhangzk commented Nov 8, 2024

zhuole1025 commented Nov 8, 2024

Ph0rk0z commented Nov 9, 2024 •

edited

Loading

bghira commented Nov 11, 2024

Ph0rk0z commented Nov 12, 2024

Kernel error for running example.py #1

Kernel error for running example.py #1

Comments

zhuole1025 commented Nov 8, 2024

sxtyzhangzk commented Nov 8, 2024

zhuole1025 commented Nov 8, 2024

Ph0rk0z commented Nov 9, 2024 • edited Loading

bghira commented Nov 11, 2024

Ph0rk0z commented Nov 12, 2024

Ph0rk0z commented Nov 9, 2024 •

edited

Loading