From eb7adc22bf52288549fff26482b03ca1c496c7b9 Mon Sep 17 00:00:00 2001 From: dlyakhov Date: Thu, 14 Mar 2024 13:39:41 +0100 Subject: [PATCH 1/3] Fix broken link --- docs/compression_algorithms/CompressWeights.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/docs/compression_algorithms/CompressWeights.md b/docs/compression_algorithms/CompressWeights.md index f571c5e9296..df598422df3 100644 --- a/docs/compression_algorithms/CompressWeights.md +++ b/docs/compression_algorithms/CompressWeights.md @@ -400,7 +400,7 @@ This modification applies only for patterns `MatMul-Multiply-MatMul` (for exampl #### Additional resources - [LLM Weight Compression](https://docs.openvino.ai/2024/openvino-workflow/model-optimization-guide/weight-compression.html) -- [Optimize and Deploy Generative AI Models using Hugging Face Optimum Intel](https://docs.openvino.ai/2024/openvino-workflow/generative-ai-models-guide.html) +- [Optimize and Deploy Generative AI Models using Hugging Face Optimum Intel](https://docs.openvino.ai/2024/learn-openvino/llm_inference_guide.html) - [Optimum Intel documentation](https://huggingface.co/docs/optimum/intel/inference) - [Large Language Models Weight Compression Example](https://github.com/openvinotoolkit/nncf/blob/develop/examples/llm_compression/openvino/tiny_llama) - [Tuning Ratio and Group Size Example](https://github.com/openvinotoolkit/nncf/blob/develop/examples/llm_compression/openvino/tiny_llama_find_hyperparams) From eae38be280b915d3145074cdb96fe864520abd59 Mon Sep 17 00:00:00 2001 From: dlyakhov Date: Thu, 14 Mar 2024 13:42:27 +0100 Subject: [PATCH 2/3] Description is updated --- docs/compression_algorithms/CompressWeights.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/docs/compression_algorithms/CompressWeights.md b/docs/compression_algorithms/CompressWeights.md index df598422df3..a79dcec3bfe 100644 --- a/docs/compression_algorithms/CompressWeights.md +++ b/docs/compression_algorithms/CompressWeights.md @@ -400,7 +400,7 @@ This modification applies only for patterns `MatMul-Multiply-MatMul` (for exampl #### Additional resources - [LLM Weight Compression](https://docs.openvino.ai/2024/openvino-workflow/model-optimization-guide/weight-compression.html) -- [Optimize and Deploy Generative AI Models using Hugging Face Optimum Intel](https://docs.openvino.ai/2024/learn-openvino/llm_inference_guide.html) +- [Large Language Model Inference Guide](https://docs.openvino.ai/2024/learn-openvino/llm_inference_guide.html) - [Optimum Intel documentation](https://huggingface.co/docs/optimum/intel/inference) - [Large Language Models Weight Compression Example](https://github.com/openvinotoolkit/nncf/blob/develop/examples/llm_compression/openvino/tiny_llama) - [Tuning Ratio and Group Size Example](https://github.com/openvinotoolkit/nncf/blob/develop/examples/llm_compression/openvino/tiny_llama_find_hyperparams) From 4cb3907837aa249e80b7e093e2f03ca43b990d28 Mon Sep 17 00:00:00 2001 From: Daniil Lyakhov Date: Thu, 14 Mar 2024 16:03:51 +0100 Subject: [PATCH 3/3] Update docs/compression_algorithms/CompressWeights.md Co-authored-by: Lyalyushkin Nikolay --- docs/compression_algorithms/CompressWeights.md | 1 + 1 file changed, 1 insertion(+) diff --git a/docs/compression_algorithms/CompressWeights.md b/docs/compression_algorithms/CompressWeights.md index a79dcec3bfe..bc53948441b 100644 --- a/docs/compression_algorithms/CompressWeights.md +++ b/docs/compression_algorithms/CompressWeights.md @@ -401,6 +401,7 @@ This modification applies only for patterns `MatMul-Multiply-MatMul` (for exampl - [LLM Weight Compression](https://docs.openvino.ai/2024/openvino-workflow/model-optimization-guide/weight-compression.html) - [Large Language Model Inference Guide](https://docs.openvino.ai/2024/learn-openvino/llm_inference_guide.html) +- [Inference with Hugging Face and Optimum Intel](https://docs.openvino.ai/2024/learn-openvino/llm_inference_guide/llm-inference-hf.html) - [Optimum Intel documentation](https://huggingface.co/docs/optimum/intel/inference) - [Large Language Models Weight Compression Example](https://github.com/openvinotoolkit/nncf/blob/develop/examples/llm_compression/openvino/tiny_llama) - [Tuning Ratio and Group Size Example](https://github.com/openvinotoolkit/nncf/blob/develop/examples/llm_compression/openvino/tiny_llama_find_hyperparams)