Skip to content

Commit

Permalink
modify the path of script
Browse files Browse the repository at this point in the history
  • Loading branch information
WeiguangHan committed Oct 17, 2023
1 parent f0f15f1 commit e33668f
Show file tree
Hide file tree
Showing 2 changed files with 570 additions and 0 deletions.
22 changes: 22 additions & 0 deletions python/llm/test/benchmark/gpu/arc-perf-test.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,22 @@
repo_id:
- 'THUDM/chatglm2-6b'
- 'meta-llama/Llama-2-7b-chat-hf'
- 'meta-llama/Llama-2-13b-chat-hf'
- 'tiiuae/falcon-7b-instruct-with-patch'
- 'mosaicml/mpt-7b-chat'
local_model_hub: '/mnt/disk1/models'
warm_up: 1
num_trials: 3
num_beams: 1 # default to greedy search
in_out_pairs:
- '32-32'
- '1024-128'
- '2048-256'
test_api:
# - "transformer_int4"
# - "native_int4"
# - "optimize_model"
# - "pytorch_autocast_bf16"
# - "ipex_fp16_gpu" # on Intel GPU
- "transformer_int4_gpu" # on Intel GPU
# - "optimize_model_gpu" # on Intel GPU
Loading

0 comments on commit e33668f

Please sign in to comment.