Enable CPU Speculative Mixtral #10497

Uxito-Ada · 2024-03-21T07:30:16Z

Description

Speculative Mixtral on CPU, which is tested with mistralai/Mixtral-8x7B-Instruct-v0.1 and mistralai/Mixtral-8x7B-v0.1 on SPR.

1. Why the change?

as above

2. User API changes

no

3. Summary of the change

Enable CPU Speculative Mixtral

4. How to test?

glorysdj · 2024-03-26T00:14:17Z

python/llm/example/CPU/Speculative-Decoding/mixtral/README.md

@@ -0,0 +1,86 @@
+# Mixtral
+In this directory, you will find examples on how you could run Mixtral BF16 inference with self-speculative decoding using BigDL-LLM on [Intel CPUs](../README.md). For illustration purposes,we utilize the [mistralai/Mixtral-8x7B-Instruct-v0.1](https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1) and [mistralai/Mixtral-8x7B-v0.1](https://huggingface.co/mistralai/Mixtral-8x7B-v0.1) as reference Mixtral models.


change all bigdl-llm to ipex-llm

Uxito-Ada · 2024-03-26T01:25:10Z

migrate to #10539

Enable CPU Speculative Mixtral

aaf3452

Uxito-Ada requested review from qiyuangong and glorysdj March 21, 2024 07:30

glorysdj reviewed Mar 26, 2024

View reviewed changes

Uxito-Ada closed this Mar 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable CPU Speculative Mixtral #10497

Enable CPU Speculative Mixtral #10497

Uxito-Ada commented Mar 21, 2024

glorysdj Mar 26, 2024

Uxito-Ada commented Mar 26, 2024

		@@ -0,0 +1,86 @@
		# Mixtral
		In this directory, you will find examples on how you could run Mixtral BF16 inference with self-speculative decoding using BigDL-LLM on [Intel CPUs](../README.md). For illustration purposes,we utilize the [mistralai/Mixtral-8x7B-Instruct-v0.1](https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1) and [mistralai/Mixtral-8x7B-v0.1](https://huggingface.co/mistralai/Mixtral-8x7B-v0.1) as reference Mixtral models.

Enable CPU Speculative Mixtral #10497

Enable CPU Speculative Mixtral #10497

Conversation

Uxito-Ada commented Mar 21, 2024

Description

1. Why the change?

2. User API changes

3. Summary of the change

4. How to test?

glorysdj Mar 26, 2024

Choose a reason for hiding this comment

Uxito-Ada commented Mar 26, 2024