Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add to README a command for export of diffusion models with hybrid quantization #1228

Conversation

nikita-savelyevv
Copy link
Contributor

Added a command for exporting quantized diffusion models similar to how it is done for LMs.

README.md Outdated
@@ -163,6 +163,9 @@ For more examples check out our [LLM Inference Guide](https://docs.openvino.ai/2
```sh
#Download and convert to OpenVINO dreamlike-anime-1.0 model
optimum-cli export openvino --model dreamlike-art/dreamlike-anime-1.0 --task stable-diffusion --weight-format fp16 dreamlike_anime_1_0_ov/FP16

#Download, convert to OpenVINO and apply int8 hybrid quantization to dreamlike-anime-1.0 model
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Maybe explicitly state in the comment that this is an alternative, e.g.:
Or use INT8 hybrid quantization to optimize and speed up the model or
Also you can use INT8 hybrid quantization to further optimize the model and reduce the inference latency, etc,

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Updated

@ilya-lavrenov ilya-lavrenov added port to LTS PR needs to be ported to LTS and removed category: sampling Sampling / Decoding algorithms labels Nov 19, 2024
@github-actions github-actions bot added the category: sampling Sampling / Decoding algorithms label Nov 19, 2024
@ilya-lavrenov ilya-lavrenov added this pull request to the merge queue Nov 19, 2024
@ilya-lavrenov ilya-lavrenov added this to the 2025.0 milestone Nov 19, 2024
Merged via the queue into openvinotoolkit:master with commit f78001d Nov 19, 2024
51 checks passed
ilya-lavrenov pushed a commit to ilya-lavrenov/openvino.genai that referenced this pull request Nov 20, 2024
…antization (openvinotoolkit#1228)

Added a command for exporting quantized diffusion models similar to how
it is done for LMs.
github-merge-queue bot pushed a commit that referenced this pull request Nov 21, 2024
@ilya-lavrenov ilya-lavrenov removed the port to LTS PR needs to be ported to LTS label Nov 21, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
category: sampling Sampling / Decoding algorithms
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants