Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Benchmark: add H100 suite #6047

Merged
merged 15 commits into from
Jul 11, 2024
Merged

Benchmark: add H100 suite #6047

merged 15 commits into from
Jul 11, 2024

Conversation

simon-mo
Copy link
Collaborator

@simon-mo simon-mo commented Jul 1, 2024

I have recently added an H100 agent which will be online for 12 hours per day. Let's test it out.

Successful build: https://buildkite.com/vllm/performance-benchmark/builds/4493

@robertgshaw2-neuralmagic
Copy link
Collaborator

robertgshaw2-neuralmagic commented Jul 2, 2024

Can I use this for some fp8 test - especially Mixtral

@simon-mo simon-mo requested a review from KuntaiDu July 9, 2024 16:12
@simon-mo
Copy link
Collaborator Author

simon-mo commented Jul 9, 2024

@KuntaiDu can you please review this? I think I got it working (see link in the description) by adding bunch of clean up in the shell script

Copy link
Collaborator

@KuntaiDu KuntaiDu left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. I kill vllm process via pkill pt_main_thread previously but I guess pkill -9 -f python3 also works.

@KuntaiDu KuntaiDu self-requested a review July 9, 2024 21:13
/workspace/buildkite-agent artifact upload "$RESULTS_FOLDER/*"

# Use the determined command to annotate and upload artifacts
$BUILDKITE_AGENT_COMMAND annotate --style "info" --context "benchmark-results" < $RESULTS_FOLDER/benchmark_results.md
Copy link
Collaborator

@KuntaiDu KuntaiDu Jul 9, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

As the A100 benchmark also uses the same context (benchmark-results), it will overwrite the annotation. Maybe add --append parameter, or annotate in a different context (like "${gpu_name}-benchmark-results") .

# since we are in container anyway
pkill -9 -f python
pkill -9 -f python3

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I tried pkill pt_main_thread, but I guess pkill -9 -f python3 also works.

Copy link
Collaborator

@KuntaiDu KuntaiDu left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Maybe change the annotation part

/workspace/buildkite-agent artifact upload "$RESULTS_FOLDER/*"

# Use the determined command to annotate and upload artifacts
$BUILDKITE_AGENT_COMMAND annotate --style "info" --context "h100-benchmark-results" < $RESULTS_FOLDER/benchmark_results.md
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

--context "${gpu_type}-benchmark-results"

Copy link
Collaborator

@comaniac comaniac left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. We could add FP8 cases in a follow-up PR.

@simon-mo simon-mo merged commit 52b7fcb into vllm-project:main Jul 11, 2024
71 checks passed
@cadedaniel
Copy link
Collaborator

this is awesome, thanks for adding

dtrifiro pushed a commit to opendatahub-io/vllm that referenced this pull request Jul 17, 2024
xjpang pushed a commit to xjpang/vllm that referenced this pull request Jul 24, 2024
Alvant pushed a commit to compressa-ai/vllm that referenced this pull request Oct 26, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants