transformers==4.37, yi & yuan2 & vicuna #11805

JinheTang · 2024-08-15T02:38:48Z

added sample output for yi & vicuna
updated vicuna example model to 7b-v1.5 & 13b-v1.5
added yi-6b-chat model to yi

rnwang04 · 2024-08-15T02:48:47Z

python/llm/example/GPU/HuggingFace/LLM/vicuna/README.md

 ```log
-Inference time: xxxx s
+Inference time: 1.0269405841827393 s


Don't show real performance data in our example, just keep Inference time: xxxx s.

rnwang04 · 2024-08-15T02:49:08Z

python/llm/example/GPU/HuggingFace/LLM/vicuna/README.md

 ```log
-Inference time: xxxx s
+Inference time: 0.7162051200866699 s


just keep Inference time: xxxx s.

rnwang04 · 2024-08-15T02:49:52Z

python/llm/example/GPU/HuggingFace/LLM/yi/README.md

@@ -112,18 +112,51 @@ python ./generate.py

 In the example, several arguments can be passed to satisfy your requirements:

- `--repo-id-or-model-path REPO_ID_OR_MODEL_PATH`: argument defining the huggingface repo id for the Yi model (e.g. `01-ai/Yi-6B`) to be downloaded, or the path to the huggingface checkpoint folder. It is default to be `'01-ai/Yi-6B'`.
+- `--repo-id-or-model-path REPO_ID_OR_MODEL_PATH`: argument defining the huggingface repo id for the Yi model (e.g. `01-ai/Yi-6B`) to be downloaded, or the path to the huggingface checkpoint folder. It is default to be `'01-ai/Yi-6B-Chat'`.


(e.g. 01-ai/Yi-6B and 01-ai/Yi-6B-Chat)

rnwang04 · 2024-08-15T02:50:26Z

python/llm/example/GPU/HuggingFace/LLM/yi/README.md

 - `--prompt PROMPT`: argument defining the prompt to be infered (with integrated prompt format for chat). It is default to be `'AI是什么？'`.
 - `--n-predict N_PREDICT`: argument defining the max number of tokens to predict. It is default to be `32`.

 #### Sample Output
 #### [01-ai/Yi-6B](https://huggingface.co/01-ai/Yi-6B)

 ```log
-Inference time: xxxx s
+Inference time: 1.1255202293395996 s


just keep Inference time: xxxx s.

rnwang04 · 2024-08-15T02:50:36Z

python/llm/example/GPU/HuggingFace/LLM/yi/README.md

+
+#### [01-ai/Yi-6B-Chat](https://huggingface.co/01-ai/Yi-6B-Chat)
+```log
+Inference time: 0.5318927764892578 s


just keep Inference time: xxxx s.

rnwang04 · 2024-08-15T02:51:10Z

python/llm/example/GPU/HuggingFace/LLM/yi/generate.py

@@ -32,10 +32,10 @@

 if __name__ == '__main__':
    parser = argparse.ArgumentParser(description='Predict Tokens using `generate()` API for Yi model')
-    parser.add_argument('--repo-id-or-model-path', type=str, default="01-ai/Yi-6B",
+    parser.add_argument('--repo-id-or-model-path', type=str, default="/home/arda/jinhe/Yi-6B-Chat",


default="01-ai/Yi-6B-Chat"

rnwang04 · 2024-08-15T03:20:00Z

python/llm/example/GPU/HuggingFace/LLM/yi/generate.py

@@ -32,10 +32,10 @@

 if __name__ == '__main__':
    parser = argparse.ArgumentParser(description='Predict Tokens using `generate()` API for Yi model')
-    parser.add_argument('--repo-id-or-model-path', type=str, default="01-ai/Yi-6B",
+    parser.add_argument('--repo-id-or-model-path', type=str, default="/01-ai/Yi-6B-Chat",


default="01-ai/Yi-6B-Chat"

rnwang04 · 2024-08-15T05:28:06Z

python/llm/example/GPU/HuggingFace/LLM/yi/generate.py

@@ -22,20 +22,13 @@
 from transformers import AutoTokenizer

 # Refer to https://huggingface.co/01-ai/Yi-6B-Chat#31-use-the-chat-model


also delete this line

rnwang04

LGTM

JinheTang added 3 commits August 15, 2024 09:51

transformers==4.37

6493736

added yi model

8a5f338

added yi model

b06c51f

JinheTang changed the title ~~transformers==4.37, yuan2 & vicuna~~ transformers==4.37, yi & yuan2 & vicuna Aug 15, 2024

rnwang04 reviewed Aug 15, 2024

View reviewed changes

xxxx

9aca692

rnwang04 reviewed Aug 15, 2024

View reviewed changes

delete prompt template

504f243

rnwang04 reviewed Aug 15, 2024

View reviewed changes

/ and delete

876eb20

rnwang04 approved these changes Aug 15, 2024

View reviewed changes

rnwang04 merged commit 2fbbb51 into intel-analytics:main Aug 15, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

transformers==4.37, yi & yuan2 & vicuna #11805

transformers==4.37, yi & yuan2 & vicuna #11805

JinheTang commented Aug 15, 2024

rnwang04 Aug 15, 2024

rnwang04 Aug 15, 2024

rnwang04 Aug 15, 2024

rnwang04 Aug 15, 2024

rnwang04 Aug 15, 2024

rnwang04 Aug 15, 2024

rnwang04 Aug 15, 2024

rnwang04 Aug 15, 2024

rnwang04 left a comment

		@@ -22,20 +22,13 @@
		from transformers import AutoTokenizer

		# Refer to https://huggingface.co/01-ai/Yi-6B-Chat#31-use-the-chat-model

transformers==4.37, yi & yuan2 & vicuna #11805

transformers==4.37, yi & yuan2 & vicuna #11805

Conversation

JinheTang commented Aug 15, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

rnwang04 left a comment

Choose a reason for hiding this comment