Skip to content

[Kernel] Optimize FP8 support for MoE kernel / Mixtral via static scales#4343

Merged
robertgshaw2-neuralmagic merged 33 commits intovllm-project:mainfrom pcmoritz:mixtral-fp8-staticApr 27, 2024