Update benchmark script for NPU #11932

plusbang · 2024-08-27T03:28:58Z

Description

Update all-in-one benchmark script, we could also use all-in-one benchmark npu models with fused decoderlayer optimization.

usage:
set optimize_model as True in config.yaml

4. How to test?

Application test

cyita · 2024-08-27T06:22:08Z

python/llm/dev/benchmark/all-in-one/run.py

                                          torch_dtype='auto', attn_implementation="eager").eval()
        tokenizer = AutoTokenizer.from_pretrained(model_path, trust_remote_code=True)
    elif repo_id in LLAMA_IDS:
        model = AutoModelForCausalLM.from_pretrained(model_path, load_in_low_bit=low_bit, trust_remote_code=True,
+                                                     optimize_model=optimize_model, max_output_len=max_output_len, max_prompt_len=int(in_out_len[0]), transpose_value_cache=True,


Should we add torch_dtype=torch.float16 as suggested in our example?

Should we add torch_dtype=torch.float16 as suggested in our example?

Have updated.

plusbang added 7 commits August 27, 2024 10:52

update

9c8b052

small fix

4fcffc4

align script

03bbe71

small fix

2df0a37

revert readme

39cc26c

revert readme

d2f8c0d

fix

a4a982b

plusbang requested review from sgwhat and cyita August 27, 2024 05:39

sgwhat approved these changes Aug 27, 2024

View reviewed changes

cyita reviewed Aug 27, 2024

View reviewed changes

add dtype

139a7bd

plusbang merged commit 7c8c9a0 into intel-analytics:main Aug 27, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update benchmark script for NPU #11932

Update benchmark script for NPU #11932

plusbang commented Aug 27, 2024 •

edited

Loading

cyita Aug 27, 2024

plusbang Aug 27, 2024

Update benchmark script for NPU #11932

Update benchmark script for NPU #11932

Conversation

plusbang commented Aug 27, 2024 • edited Loading

Description

4. How to test?

cyita Aug 27, 2024

Choose a reason for hiding this comment

plusbang Aug 27, 2024

Choose a reason for hiding this comment

plusbang commented Aug 27, 2024 •

edited

Loading