Add to README a command for export of diffusion models with hybrid quantization #1228

nikita-savelyevv · 2024-11-19T12:14:44Z

Added a command for exporting quantized diffusion models similar to how it is done for LMs.

AlexKoff88 · 2024-11-19T12:26:53Z

README.md

@@ -163,6 +163,9 @@ For more examples check out our [LLM Inference Guide](https://docs.openvino.ai/2
 ```sh
 #Download and convert to OpenVINO dreamlike-anime-1.0 model
 optimum-cli export openvino --model dreamlike-art/dreamlike-anime-1.0 --task stable-diffusion --weight-format fp16 dreamlike_anime_1_0_ov/FP16
+
+#Download, convert to OpenVINO and apply int8 hybrid quantization to dreamlike-anime-1.0 model


Maybe explicitly state in the comment that this is an alternative, e.g.:
Or use INT8 hybrid quantization to optimize and speed up the model or
Also you can use INT8 hybrid quantization to further optimize the model and reduce the inference latency, etc,

…antization (openvinotoolkit#1228) Added a command for exporting quantized diffusion models similar to how it is done for LMs.

**Ported:** - #1187 - #1189 - #1192 - #1196 - #1202 - #1204 - #1210 - #1217 - #1218 - #1221 - #1222 - #1228

Add to README a command for export of SD model with hybrid quantization

709d243

github-actions bot added the category: sampling Sampling / Decoding algorithms label Nov 19, 2024

nikita-savelyevv requested a review from MaximProshin November 19, 2024 12:17

MaximProshin requested review from AlexKoff88 and ilya-lavrenov November 19, 2024 12:18

MaximProshin assigned ilya-lavrenov Nov 19, 2024

AlexKoff88 reviewed Nov 19, 2024

View reviewed changes

ilya-lavrenov added port to LTS PR needs to be ported to LTS and removed category: sampling Sampling / Decoding algorithms labels Nov 19, 2024

MaximProshin approved these changes Nov 19, 2024

View reviewed changes

github-actions bot added the category: sampling Sampling / Decoding algorithms label Nov 19, 2024

Update comment

d17059a

AlexKoff88 approved these changes Nov 19, 2024

View reviewed changes

ilya-lavrenov added this pull request to the merge queue Nov 19, 2024

ilya-lavrenov added this to the 2025.0 milestone Nov 19, 2024

Merged via the queue into openvinotoolkit:master with commit f78001d Nov 19, 2024
51 checks passed

ilya-lavrenov mentioned this pull request Nov 20, 2024

Port fixes from master to 2024.5.1 / 2024.6.0 #1239

Merged

github-merge-queue bot pushed a commit that referenced this pull request Nov 21, 2024

Port fixes from master to 2024.5.1 / 2024.6.0 (#1239)

da7a7ca

**Ported:** - #1187 - #1189 - #1192 - #1196 - #1202 - #1204 - #1210 - #1217 - #1218 - #1221 - #1222 - #1228

ilya-lavrenov removed the port to LTS PR needs to be ported to LTS label Nov 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add to README a command for export of diffusion models with hybrid quantization #1228

Add to README a command for export of diffusion models with hybrid quantization #1228

nikita-savelyevv commented Nov 19, 2024

AlexKoff88 Nov 19, 2024

nikita-savelyevv Nov 19, 2024

Add to README a command for export of diffusion models with hybrid quantization #1228

Add to README a command for export of diffusion models with hybrid quantization #1228

Conversation

nikita-savelyevv commented Nov 19, 2024

AlexKoff88 Nov 19, 2024

Choose a reason for hiding this comment

nikita-savelyevv Nov 19, 2024

Choose a reason for hiding this comment