Update doc for running npu generate example with ipex-llm[npu] #11876

sgwhat · 2024-08-21T02:15:57Z

Description

1. Why the change?

For NPU generate example, switch npu installation from pip install ipex-llm[all] pip install intel-npu-acceleration-library==1.3 to our new release pip install ipex-llm[npu].

2. User API changes

Installation changes as above.

3. Summary of the change

Document changes.

4. How to test?

Application test

plusbang · 2024-08-21T05:44:03Z

python/llm/example/NPU/HF-Transformers-AutoModels/LLM/llama2.py

@@ -54,7 +54,7 @@ def get_prompt(message: str, chat_history: list[tuple[str, str]],
                        help='Prompt to infer')
    parser.add_argument("--n-predict", type=int, default=32, help="Max tokens to predict")
    parser.add_argument("--max-output-len", type=int, default=1024)
-    parser.add_argument("--max-prompt-len", type=int, default=768)
+    parser.add_argument("--max-prompt-len", type=int, default=512)


Please also update default value in readme. Other LGTM.

sgwhat added 2 commits August 21, 2024 10:14

update doc for running npu generate example with ipex-llm[npu]

158d065

switch max_prompt_len to 512 to fix compile error on mtl

76d245b

sgwhat requested a review from plusbang August 21, 2024 05:40

plusbang approved these changes Aug 21, 2024

View reviewed changes

update doc

f07f2ca

sgwhat merged commit 8c5c7f3 into intel-analytics:main Aug 21, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update doc for running npu generate example with ipex-llm[npu] #11876

Update doc for running npu generate example with ipex-llm[npu] #11876

sgwhat commented Aug 21, 2024 •

edited

Loading

plusbang Aug 21, 2024

Update doc for running npu generate example with ipex-llm[npu] #11876

Update doc for running npu generate example with ipex-llm[npu] #11876

Conversation

sgwhat commented Aug 21, 2024 • edited Loading

Description

1. Why the change?

2. User API changes

3. Summary of the change

4. How to test?

plusbang Aug 21, 2024

Choose a reason for hiding this comment

sgwhat commented Aug 21, 2024 •

edited

Loading