Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CI Jobs are timing out in the restructured-pipeline #10293

Closed
DonJayamanne opened this issue Feb 24, 2020 · 7 comments
Closed

CI Jobs are timing out in the restructured-pipeline #10293

DonJayamanne opened this issue Feb 24, 2020 · 7 comments
Assignees
Labels
bug Issue identified by VS Code Team member as probable bug

Comments

@DonJayamanne
Copy link

DonJayamanne commented Feb 24, 2020

With the new changes we're running functional tests and single root worksapce tests in the same job.
Unfortunately both are slow on windows.

  • Functional tests >10 minutes
  • Single root workspace tests ~= 50 minutes
  • Job has a timeout of 60 minutes

Hence the jobs are cancelled before completion.

We'd need to split them into two separate jobs or increase the timeout.
Increasing timeout only causes the entire pipeline to run slower.
I.e. two separate jobs results in parallelized jobs = faster.

Bug as this now causes jobs to fail cancel out, which didn't in the past.
Also, if there are failures in the tests, these won't be reported anymore, as the whole job has been cancelled

@DonJayamanne DonJayamanne added bug Issue identified by VS Code Team member as probable bug triage-needed Needs assignment to the proper sub-team labels Feb 24, 2020
@DonJayamanne DonJayamanne changed the title CI Jobs are timing out CI Jobs are timing out in the restructured-pipeline Feb 24, 2020
@ghost ghost removed the triage-needed Needs assignment to the proper sub-team label Feb 24, 2020
@karthiknadig
Copy link
Member

May be we should also increase the Max Parallel. Currently it is 2, and it should be more.

@DonJayamanne
Copy link
Author

May be we should also increase the Max Parallel. Currently it is 2, and it should be more.

Sure, i'll create a separate PR, or we need a new issue for that fix.

@DonJayamanne
Copy link
Author

Oh sorry, @karthiknadig that's fine, I though this was my PR, hence asked for a separate issue for the change.

@ericsnowcurrently
Copy link
Member

For the PR pipeline it would be better if we could increase the timeout instead of splitting the job up. The job is not a required one so folks don't need to wait for it if the PR changes don't necessitate that level of validation. If we split the job then that is 1 extra build agent we are using every time a PR gets created or updated.

For the other pipelines (not sure they are affected), splitting it would be fine.

@ericsnowcurrently
Copy link
Member

May be we should also increase the Max Parallel. Currently it is 2, and it should be more.

Keep in mind that increasing the max parallel setting will mean we are using more agents, which is what we were trying to improve.

Also, increasing the max parallel value shouldn't impact timeouts, so I don't see the value in doing so. If folks are running the CI (or other) pipeline manually and that is taking too long then we should have a separate manual-only pipeline with a higher max parallel value. In fact, I already added a build/ci/vscode-python-ci-manual.yaml and it only needs to be set up on AzDO.

@ericsnowcurrently
Copy link
Member

As I mentioned elsewhere, I'm sure that splitting the job back into 2 would be fine.

FWIW, my only concern is that the repo has any PR-gating CI jobs that take more than a few minutes to run. At some point we need to have our unit test coverage to the level that we can confidently rely on it almost exclusively for gating PRs (with perhaps a handful of fast functional tests to cover OS-specific behaviors). However, that's a discussion for a different forum. 😄

@DonJayamanne
Copy link
Author

Validated

@ghost ghost removed the needs PR label Mar 5, 2020
@lock lock bot locked as resolved and limited conversation to collaborators Mar 17, 2020
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
bug Issue identified by VS Code Team member as probable bug
Projects
None yet
Development

No branches or pull requests

4 participants