-
-
Notifications
You must be signed in to change notification settings - Fork 5.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CI woes #27260
Comments
On Appveyor, it is roughly 35 minutes to build the sysimg, and the rest of the time is to run tests. Turning off the debug build might shave off 10 mins. We could probably run a smaller testsuite. Sharding across multiple appveyor jobs will probably be more complex and overall slow the queue further. |
For the record, FreeBSD CI ran into some issues like #23143 and randomly freezing in kernel stress testing. |
Do we really start 32 workers on appveyor in tests? Might it be oversubscribing things, or is that by design? |
Travis change happened suddenly and in response to no corresponding change on our end. Did they maybe water down their VMs again? It's happened several times before with a similar effect every time. |
Not at once. When a worker takes too much RSS memory it exit and starts a new worker. 32 is just the total number of workers that got started during all the tests. |
@iblis17 I am just curious what it takes to reproduce your setup on a linux box? Also, perhaps on a windows box? |
I merged #27257. Hopefully that should get circle back in business. |
I think we can turn off the debug builds. Or, we could just build libjulia-debug to make sure the debug build works, but not build the system image in debug mode (since it takes a while and is not really different from the release build). |
Trying to do the simplest thing in #27263 for Appveyor. This will disable the full debug build. Let's see if it helps. |
Perhaps they are throttling us on travis so that we migrate to the new thing? https://blog.travis-ci.com/2018-05-02-open-source-projects-on-travis-ci-com-with-github-apps |
@ViralBShah It's just a normal BuildBot setup. I don't think my setup is different from https://build.julialang.org/. I only spent effort on daily maintenance: first, check zombie/frozen processes and killed them manually (to releasing memory). Not sure why there are some processes cannot be killed by BuildBot. |
At least for Linux, setting up a BuildBot would be fairly trivial (and we likely have the capacity). Mac is probably more challenging. |
Appveyor is now on increased capacity of 10 concurrent workers, with time allocation of 3 hours. |
Great, only Travis left to figure out then! |
They wrote back saying that they can only help us early next week (which may or may not even be Monday). |
Also, the debug build is 20 minutes. Is that really worthwhile to build? |
Looking at the history of AV, the last successful run was on 23da960, 6 days ago. It took about 2 hours. Many builds were canceled just after that to have CI time for the alpha release. But every build that ran and didn't fail for other reasons timed out after 3 hours. What's happened there? |
I think that is #27274 |
Especially #27274 (comment), yes. Thanks for the pointer. |
CI is in a sad state which means a lot of CI gets rerun which increases queues and it is hard to make solid releases with flaky CI. This is an issue to collect the different CI problems:
Travis
Building sysimg hangs
Example logs: https://travis-ci.org/JuliaLang/julia/jobs/383489913, https://travis-ci.org/JuliaLang/julia/jobs/383452631
Hangs in other places:
Might be the same issue as the previous one. It is odd that it freezes in the middle of writing a word...
Example log: https://travis-ci.org/JuliaLang/julia/jobs/383458103
AppVeyor
Hitting 2hr time limits
Happens frequently.
Top 15 test groups in terms of time spent (seconds):
Time to build sysimg:
Non-debug:
Debug:
Example log: https://ci.appveyor.com/project/JuliaLang/julia/build/1.0.27099/job/fevqdpy21ka8btux
CircleCIFails getting gfortranExample log: https://circleci.com/gh/JuliaLang/julia/25927?utm_campaign=vcs-integration-link&utm_medium=referral&utm_source=github-build-linkPerhaps fixed by #27257.FreeBSD
Seems fairly solid
The text was updated successfully, but these errors were encountered: