Use pytorch binary for macos-arm64 workflow #1215

sjain-stanford · 2022-08-12T12:09:49Z

We're seeing flakiness of the macos-arm64 build with "pytorch source" enabled (passed on original PR but failed twice on main since, introducing increased overhead in landing PRs).

I'm leaning towards flipping this back to binary. This should help with the following:

reduce the number of long running pytorch source builds to one job (ubuntu-OOT)
reduce cache invalidations for macos builds to only llvm updates (which are weekly rather than hourly)
speed-up fail-fast signal despite being decoupled from ubuntu matrix jobs

My earlier[ PR](#1213) had (among other things) decoupled ubuntu and macos builds into separate matrix runs. This is not working well due to limited number of MacOS GHA VMs causing long queue times and backlog. There are two reasons causing this backlog: 1. macos arm64 builds with pytorch source are getting erratically cancelled due to resource / network constraints. This is addressed with this: #1215 > "macos-arm64 (in-tree, OFF) The hosted runner: GitHub Actions 3 lost communication with the server. Anything in your workflow that terminates the runner process, starves it for CPU/Memory, or blocks its network access can cause this error." 2. macos runs don't fail-fast when ubuntu runs fail due to being in separate matrix setups. This PR couples them again.

* Add git commit id to llvm.ident metadata * Addressing review comment: llvm#1211 (comment) Signed-off-by: Whitney Tsang <[email protected]> Co-authored-by: Whitney Tsang <[email protected]> Co-authored-by: Ettore Tiotto <[email protected]>

sjain-stanford requested review from powderluv and silvasean August 12, 2022 12:09

use pytorch binary for macos-arm64 builds

d0220bf

sjain-stanford force-pushed the sambhav/pytorch_binary_macos branch from 3cd4e1f to d0220bf Compare August 12, 2022 12:29

powderluv approved these changes Aug 12, 2022

View reviewed changes

powderluv merged commit b8bd0a4 into llvm:main Aug 12, 2022

sjain-stanford mentioned this pull request Aug 12, 2022

Merge matrix runs to fail fast globally #1216

Merged

tanyokwok mentioned this pull request Sep 21, 2022

features/bladedisc rebase 20220830 pai-disc/torch-mlir#20

Closed

sjain-stanford deleted the sambhav/pytorch_binary_macos branch November 10, 2022 19:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use pytorch binary for macos-arm64 workflow #1215

Use pytorch binary for macos-arm64 workflow #1215

sjain-stanford commented Aug 12, 2022 •

edited

Loading

Use pytorch binary for macos-arm64 workflow #1215

Use pytorch binary for macos-arm64 workflow #1215

Conversation

sjain-stanford commented Aug 12, 2022 • edited Loading

sjain-stanford commented Aug 12, 2022 •

edited

Loading