[Kernel] Optimize FP8 support for MoE kernel / Mixtral via static scales#4343
Merged
robertgshaw2-neuralmagic merged 33 commits intovllm-project:main from pcmoritz:mixtral-fp8-staticApr 27, 2024
+95-18
Commits
Commits on Apr 24, 2024
- committed
- committed
- committed
- committed
- committed
- committed
- committed
- committed
- committed
- committed
- committed
- committed
- committed
Commits on Apr 25, 2024
- committed
- committed
- committed
- committed
- committed
- committed
- committed
- committed
- committed
- committed
- committed