Skip to content

Commit

Permalink
enabled amd implementation for llama2
Browse files Browse the repository at this point in the history
  • Loading branch information
anandhu-eng committed Oct 19, 2024
1 parent fa0eeca commit 148650d
Show file tree
Hide file tree
Showing 2 changed files with 11 additions and 0 deletions.
7 changes: 7 additions & 0 deletions docs/benchmarks/language/llama2-70b.md
Original file line number Diff line number Diff line change
Expand Up @@ -25,3 +25,10 @@ hide:
{{ mlperf_inference_implementation_readme (4, "llama2-70b-99", "neuralmagic") }}

{{ mlperf_inference_implementation_readme (4, "llama2-70b-99.9", "neuralmagic") }}

=== "AMD"
## AMD MLPerf Implementation

{{ mlperf_inference_implementation_readme (4, "llama2-70b-99", "amd") }}

{{ mlperf_inference_implementation_readme (4, "llama2-70b-99.9", "amd") }}
4 changes: 4 additions & 0 deletions main.py
Original file line number Diff line number Diff line change
Expand Up @@ -43,6 +43,10 @@ def mlperf_inference_implementation_readme(spaces, model, implementation, *, imp
devices = [ "CUDA" ]
frameworks = [ "TensorRT" ]

elif implementation == "amd":
devices = [ "ROCm" ]
frameworks = [ "pytorch" ]

elif implementation == "neuralmagic":
devices = [ "CUDA" ]
frameworks = [ "pytorch" ]
Expand Down

0 comments on commit 148650d

Please sign in to comment.