diff --git a/README.md b/README.md index 15fa53fa84a..989095cec77 100644 --- a/README.md +++ b/README.md @@ -120,7 +120,8 @@ q_model = fit( SmoothQuant - Weight-Only Quantization (INT8/INT4/FP4/NF4) + Weight-Only Quantization (INT8/INT4/FP4/NF4) + FP8 Quantization