GitHub - jychen21/Habana-LLM-Viewer

Habana-LLM-Viewer

Habana-LLM-Viewer is a tool that provides Roofline model, LLM performance prediction and memory analysis for Intel Gaudi platform. Inspired by LLM-Viewer, Habana-LLM-Viewer can be used to estimate performance of models such as Llama2-13B, Qwen-7B, Mixtral-8x7B on Intel Gaudi platform.

Model Projection

Command

Simpily run with habana_viewer.py and the results will show up on localhost.
```
python habana_viewer.py
```

Simpily run with run_model_projection.py and the results will be saved to folder "data/model".

python run_model_projection.py \
--device IntelGaudi2 \
--device-type B \
--model Llama2-7B \
--data-type BF16 \
--batch-size BATCH_SIZE \
--context-input CONTEXT_INPUT \
--context-output CONTEXT_OUTPUT \
--kvcache-bucket 256 \
--vec-bmm

Example

Model Name	Projected Data
Llama2-7B	Link
Llama2-13B	Link
Llama3-8B	Link
Qwen-7B	Link
Qwen-14B	Link
Mixtral-8x7B	Link

Operation Projection

Command

Simpily run with run_op_projection.py and the results will be saved to folder "data/operation", same with model projection, one can modify proj_cfg in main.

python run_op_projection.py \
--device IntelGaudi2 \
--device-type B \
--op Matmul \
--data-type BF16 \
--m-list m1 m2 ... \
--n-list n1 n2 ... \
--k-list k1 k2 ...

Example

Op Name	Projected Data
Matmul	Link

Todo

Currently only cover single card perf projection, will support multi-card / multi-node.
Will cover more models / operations.

Name		Name	Last commit message	Last commit date
Latest commit History 112 Commits
data		data
scripts		scripts
.gitignore		.gitignore
README.md		README.md
habana_viewer.py		habana_viewer.py
models.py		models.py
run_model_projection.py		run_model_projection.py
run_op_projection.py		run_op_projection.py
run_projection.ipynb		run_projection.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Habana-LLM-Viewer

Model Projection

Command

Example

Operation Projection

Command

Example

Todo

About

Releases

Packages

Languages

jychen21/Habana-LLM-Viewer

Folders and files

Latest commit

History

Repository files navigation

Habana-LLM-Viewer

Model Projection

Command

Example

Operation Projection

Command

Example

Todo

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages