Skip to content

[Kernel] Refactor FP8 kv-cache with NVIDIA float8_e4m3 support (#4535) #5

[Kernel] Refactor FP8 kv-cache with NVIDIA float8_e4m3 support (#4535)

[Kernel] Refactor FP8 kv-cache with NVIDIA float8_e4m3 support (#4535) #5

Annotations

2 warnings

ruff (3.11)

succeeded May 10, 2024 in 7s