Update Llama2 multi-processes example #11852

sgwhat · 2024-08-19T10:25:23Z

Description

Modify the prompt handling section in the llama2 example for #11787.

1. Why the change?

Support users in providing their own prompts to improve response quality.

2. User API changes

N/A

3. Summary of the change

4. How to test?

MTL
LNL

plusbang · 2024-08-19T10:31:24Z

python/llm/example/NPU/HF-Transformers-AutoModels/LLM/llama2.py

-            import random
-            idx = random.randint(0, 2)
-            prompt = prompts[idx]
+            prompt = get_prompt(args.prompt, [], system_prompt=DEFAULT_SYSTEM_PROMPT)


I think we also need to update sample output in readme. Other LGTM.

jason-dai · 2024-08-19T10:48:05Z

python/llm/example/NPU/HF-Transformers-AutoModels/LLM/README.md

 - `--n-predict N_PREDICT`: argument defining the max number of tokens to predict. It is default to be `32`.
+- `--max-output-len MAX_OUTPUT_LEN`: Defines the maximum sequence length for both input and output tokens. It is default to be `1024`.
+- `--max-prompt-len MAX_PROMPT_LEN`: Defines the maximum number of tokens that the input prompt can contain. It is default to be `128`.


why 128? Maybe 512 or 768

changed to 768.

sgwhat added 2 commits August 19, 2024 18:23

update llama2 multi-processes examples

e6ecb69

update

aaa6352

sgwhat requested a review from plusbang August 19, 2024 10:27

plusbang approved these changes Aug 19, 2024

View reviewed changes

update readme

d2742a8

jason-dai reviewed Aug 19, 2024

View reviewed changes

update

4cdb867

sgwhat merged commit 7380823 into intel-analytics:main Aug 19, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update Llama2 multi-processes example #11852

Update Llama2 multi-processes example #11852

sgwhat commented Aug 19, 2024 •

edited

Loading

plusbang Aug 19, 2024

jason-dai Aug 19, 2024

sgwhat Aug 19, 2024

Update Llama2 multi-processes example #11852

Update Llama2 multi-processes example #11852

Conversation

sgwhat commented Aug 19, 2024 • edited Loading

Description

1. Why the change?

2. User API changes

3. Summary of the change

4. How to test?

plusbang Aug 19, 2024

Choose a reason for hiding this comment

jason-dai Aug 19, 2024

Choose a reason for hiding this comment

sgwhat Aug 19, 2024

Choose a reason for hiding this comment

sgwhat commented Aug 19, 2024 •

edited

Loading