-
Notifications
You must be signed in to change notification settings - Fork 293
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Resurrect ao benchmark on AWS A100 runner #2561
Closed
Closed
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Summary: attempt to fix dependencies - this is no longer compatible with the latest huggingface_hub, see failing test at https://github.com/pytorch/pytorch/actions/runs/11445304501/job/31843081598 Pull Request resolved: #2523 Reviewed By: huydhn Differential Revision: D64711662 Pulled By: wdvr fbshipit-source-id: eed9143e6e0531840a53ba5ab3fad04894727272
Summary: Some fixes for pytorch/pytorch#137602 Pull Request resolved: #2514 Reviewed By: xuzhao9 Differential Revision: D64628614 Pulled By: mikaylagawarecki fbshipit-source-id: edebf25cc6648919d5673a3baeaffdac26e5b91f
huydhn
had a problem deploying
to
docker-s3-upload
December 17, 2024 05:15 — with
GitHub Actions
Error
huydhn
had a problem deploying
to
docker-s3-upload
December 17, 2024 05:15 — with
GitHub Actions
Error
huydhn
had a problem deploying
to
docker-s3-upload
December 17, 2024 05:15 — with
GitHub Actions
Failure
huydhn
had a problem deploying
to
docker-s3-upload
December 17, 2024 05:44 — with
GitHub Actions
Error
huydhn
had a problem deploying
to
docker-s3-upload
December 17, 2024 05:44 — with
GitHub Actions
Error
huydhn
had a problem deploying
to
docker-s3-upload
December 17, 2024 05:47 — with
GitHub Actions
Failure
huydhn
had a problem deploying
to
docker-s3-upload
December 17, 2024 05:48 — with
GitHub Actions
Error
huydhn
had a problem deploying
to
docker-s3-upload
December 17, 2024 05:48 — with
GitHub Actions
Error
huydhn
had a problem deploying
to
docker-s3-upload
December 17, 2024 05:54 — with
GitHub Actions
Failure
huydhn
had a problem deploying
to
docker-s3-upload
December 17, 2024 05:55 — with
GitHub Actions
Error
huydhn
had a problem deploying
to
docker-s3-upload
December 17, 2024 05:55 — with
GitHub Actions
Error
huydhn
had a problem deploying
to
docker-s3-upload
December 17, 2024 06:15 — with
GitHub Actions
Failure
huydhn
had a problem deploying
to
docker-s3-upload
December 17, 2024 06:16 — with
GitHub Actions
Error
huydhn
had a problem deploying
to
docker-s3-upload
December 17, 2024 06:16 — with
GitHub Actions
Error
huydhn
had a problem deploying
to
docker-s3-upload
December 17, 2024 06:36 — with
GitHub Actions
Failure
huydhn
had a problem deploying
to
docker-s3-upload
December 17, 2024 06:37 — with
GitHub Actions
Error
huydhn
had a problem deploying
to
docker-s3-upload
December 18, 2024 07:35 — with
GitHub Actions
Error
huydhn
had a problem deploying
to
docker-s3-upload
December 18, 2024 07:35 — with
GitHub Actions
Error
huydhn
temporarily deployed
to
docker-s3-upload
December 18, 2024 07:53 — with
GitHub Actions
Inactive
huydhn
had a problem deploying
to
docker-s3-upload
December 18, 2024 07:54 — with
GitHub Actions
Error
huydhn
had a problem deploying
to
docker-s3-upload
December 18, 2024 07:54 — with
GitHub Actions
Error
huydhn
temporarily deployed
to
docker-s3-upload
December 18, 2024 08:17 — with
GitHub Actions
Inactive
huydhn
temporarily deployed
to
docker-s3-upload
December 18, 2024 08:17 — with
GitHub Actions
Inactive
huydhn
temporarily deployed
to
docker-s3-upload
December 18, 2024 08:17 — with
GitHub Actions
Inactive
huydhn
had a problem deploying
to
docker-s3-upload
December 18, 2024 08:18 — with
GitHub Actions
Failure
huydhn
temporarily deployed
to
docker-s3-upload
December 18, 2024 08:18 — with
GitHub Actions
Inactive
huydhn
changed the title
Resurrect ao benchmark
Resurrect ao benchmark on Dev Infra AWS A100 runner
Dec 18, 2024
huydhn
changed the title
Resurrect ao benchmark on Dev Infra AWS A100 runner
Resurrect ao benchmark on AWS A100 runner
Dec 18, 2024
@huydhn has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator. |
xuzhao9
approved these changes
Dec 18, 2024
huydhn
had a problem deploying
to
docker-s3-upload
December 18, 2024 23:49 — with
GitHub Actions
Failure
huydhn
had a problem deploying
to
docker-s3-upload
December 18, 2024 23:49 — with
GitHub Actions
Failure
@huydhn has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator. |
huydhn
added a commit
to pytorch/test-infra
that referenced
this pull request
Dec 27, 2024
After pytorch/benchmark#2561, TorchBench AO benchmark data is now available to query and we can finally use that dashboard again. If this proves useful, the next steps would be: 1. pytorch/benchmark#2561 only brings back one example model for each suite (TorchBench, HF, TIMM). We need to add more. 2. TorchBench AO dashboard shares the code with TorchInductor dashboard. While the former has been migrated to the new benchmark database, the latter hasn't. I will need to do that and clean this up in a later PR. 3. Looking at the results on the dashboard, it seems that `autoquant` works, but not `int8dynamic` and `int8weightonly`. I'm not sure if they are still relevant, but if they are, ao team should know how to fix them (cc @jerryzh168). The run on TorchBench is at https://github.com/pytorch/benchmark/actions/workflows/torchao.yml ### Testing The two metrics speedup and abs execution time are now showing up https://torchci-git-fork-huydhn-ch-migrate-torchao-queries-fbopensource.vercel.app/benchmark/torchao?dashboard=torchao&startTime=Sun%2C%2015%20Dec%202024%2011%3A06%3A45%20GMT&stopTime=Sun%2C%2022%20Dec%202024%2011%3A06%3A45%20GMT&granularity=hour&mode=inference&dtype=autoquant&deviceName=cuda%20(a100)&lBranch=main&lCommit=07e6ef43fca2e95bc6cf59f97ba6251e618ef0e3&rBranch=main&rCommit=c03fa7c6c1bd03242a9de1fddb77a9c778106afd
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
I'm bringing back some example models first, one for each set (TIMM, HF, TorchBench), to have some data to unblock our TorchAO ClickHouse migration. More can be added later if we decide to keep this workflow.
Testing
https://github.com/pytorch/benchmark/actions/runs/12388956432/job/34581035274
The results are now available on
oss_ci_benchmark_v3
tableselect * from oss_ci_benchmark_v3 where workflow_id = 12388956432