Benchmarks - Add LLaMA-2 Models #668

dpower4 · 2024-11-19T02:53:11Z

Added llama benchmark - training and inference in accordance with the existing pytorch models implementation like gpt2, lstm etc.

Dropped python 3.6 and added 3.10 for cpu unit tests
Updated base image 20.12 -> 24.03 (cuda 12.4) for cuda unit tests
added llama fp8 unit test for better code coverage, to reduce memory required
updated transformers version >= 4.28.0 for LLamaConfig
added llama2 to tensorrt
llama2 tests not added to test_tensorrt_inference_performance.py due to large memory requirement for worker gpu. tests validated separately on gh200

abuccts

pls use python3 setup.py lint to check the format and run python3 setup.py format to format and code

superbench/benchmarks/micro_benchmarks/_export_torch_to_onnx.py

dpower4 · 2024-11-19T06:37:26Z

@abuccts can I get access to the unit test logs.

codecov · 2024-11-20T02:44:14Z

Codecov Report

Attention: Patch coverage is 36.58537% with 78 lines in your changes missing coverage. Please review.

Project coverage is 84.90%. Comparing base (a8a7bed) to head (0b1da4f).
Report is 3 commits behind head on main.

Files with missing lines	Patch %	Lines
...bench/benchmarks/model_benchmarks/pytorch_llama.py	32.75%	78 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #668      +/-   ##
==========================================
- Coverage   85.77%   84.90%   -0.87%     
==========================================
  Files          97       98       +1     
  Lines        6925     7116     +191     
==========================================
+ Hits         5940     6042     +102     
- Misses        985     1074      +89

Flag	Coverage Δ
cpu-python3.10-unit-test	`70.95% <36.06%> (?)`
cpu-python3.7-unit-test	`70.91% <35.77%> (-0.68%)`	⬇️
cpu-python3.8-unit-test	`70.95% <35.83%> (-0.67%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚨 Try these New Features:

Flaky Tests Detection - Detect and resolve failed and flaky tests
JS Bundle Analysis - Avoid shipping oversized bundles

guoshzhao · 2024-11-21T01:47:50Z

LGTM, thanks! Please fix the UT failures with Python 1.10. And since the CUDA tests are running on K80 which is very old GPU, we can skip the "cuda-init-test", and just make sure "cpu-unit-test" can pass.

/__w/1/s/.eggs/setuptools_scm-8.1.0-py3.10.egg/setuptools_scm/_integration/setuptools.py:92: UserWarning: version of superbench already set
  warnings.warn(f"version of {dist_name} already set")
running lint
tests/analyzer/test_summaryop.py:7: error: Module "numpy" has no attribute "NaN"  [attr-defined]

dpower4 added 5 commits November 18, 2024 11:48

add llama init template

36bbf10

add llama2 unit test

697138a

fix dims for llama2 unit test

9355e22

update transformers version for LLamaConfig

60eae36

update docs

dadb56a

dpower4 requested review from cp5555 and a team as code owners November 19, 2024 02:53

update opset for torch onnx conversion

d2731a8

abuccts reviewed Nov 19, 2024

View reviewed changes

superbench/benchmarks/micro_benchmarks/_export_torch_to_onnx.py Show resolved Hide resolved

abuccts changed the title ~~Feat/llama2~~ Benchmarks - Add LLaMA-2 Models Nov 19, 2024

dpower4 added 2 commits November 19, 2024 00:27

format and lint

3644985

remove remnant

52f4900

lint fix

f826676

dpower4 force-pushed the feat/llama branch from b9746ca to f826676 Compare November 20, 2024 05:28

dpower4 added 5 commits November 20, 2024 00:34

replace py 3.6 with 3.10 and update cuda to 12.4 for unit test

6a41087

remove 3.6 from setup, codecov and docs

5b816b4

add llama fp8 unit test for better code coverage

f322c98

llama fp8 precision test only, to reduce memory required

b28ee17

lint fix

e6f6be3

guoshzhao approved these changes Nov 21, 2024

View reviewed changes

remove deprecated NaN usage for numpy>2.0

297a229

dpower4 force-pushed the feat/llama branch from 1c6f908 to 297a229 Compare November 21, 2024 09:30

dpower4 added 3 commits November 21, 2024 01:47

fix argparse formatting related test cases failure for 3.10

5f72f51

fix lint

8bbe326

fix lint

50452ef

dpower4 requested a review from abuccts November 21, 2024 16:55

dpower4 force-pushed the feat/llama branch from c869c95 to 50452ef Compare November 21, 2024 18:27

add llama2 to tensorrt

0b1da4f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Benchmarks - Add LLaMA-2 Models #668

Benchmarks - Add LLaMA-2 Models #668

dpower4 commented Nov 19, 2024 •

edited

Loading

abuccts left a comment

dpower4 commented Nov 19, 2024

codecov bot commented Nov 20, 2024 •

edited

Loading

guoshzhao commented Nov 21, 2024

Benchmarks - Add LLaMA-2 Models #668

Are you sure you want to change the base?

Benchmarks - Add LLaMA-2 Models #668

Conversation

dpower4 commented Nov 19, 2024 • edited Loading

abuccts left a comment

Choose a reason for hiding this comment

dpower4 commented Nov 19, 2024

codecov bot commented Nov 20, 2024 • edited Loading

Codecov Report

guoshzhao commented Nov 21, 2024

dpower4 commented Nov 19, 2024 •

edited

Loading

codecov bot commented Nov 20, 2024 •

edited

Loading