-
-
Notifications
You must be signed in to change notification settings - Fork 5.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Benchmark: add H100 suite #6047
Conversation
Can I use this for some fp8 test - especially Mixtral |
@KuntaiDu can you please review this? I think I got it working (see link in the description) by adding bunch of clean up in the shell script |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM. I kill vllm process via pkill pt_main_thread
previously but I guess pkill -9 -f python3
also works.
/workspace/buildkite-agent artifact upload "$RESULTS_FOLDER/*" | ||
|
||
# Use the determined command to annotate and upload artifacts | ||
$BUILDKITE_AGENT_COMMAND annotate --style "info" --context "benchmark-results" < $RESULTS_FOLDER/benchmark_results.md |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
As the A100 benchmark also uses the same context (benchmark-results
), it will overwrite the annotation. Maybe add --append
parameter, or annotate in a different context (like "${gpu_name}-benchmark-results") .
# since we are in container anyway | ||
pkill -9 -f python | ||
pkill -9 -f python3 | ||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I tried pkill pt_main_thread
, but I guess pkill -9 -f python3
also works.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Maybe change the annotation part
/workspace/buildkite-agent artifact upload "$RESULTS_FOLDER/*" | ||
|
||
# Use the determined command to annotate and upload artifacts | ||
$BUILDKITE_AGENT_COMMAND annotate --style "info" --context "h100-benchmark-results" < $RESULTS_FOLDER/benchmark_results.md |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
--context "${gpu_type}-benchmark-results"
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM. We could add FP8 cases in a follow-up PR.
this is awesome, thanks for adding |
Signed-off-by: Alvant <[email protected]>
I have recently added an H100 agent which will be online for 12 hours per day. Let's test it out.
Successful build: https://buildkite.com/vllm/performance-benchmark/builds/4493