-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Infinite looping in a build with a lot of cache imports #1984
Comments
Are there logs? Can't see from the link. Can you repro locally or is it related to your cache state? Only similar issues I've seen are related to |
This one seems to have stored the logs better: https://github.com/robtaylor/openlane/runs/1914403173?check_suite_focus=true (the loop triggers a github workflow issue when the worker runs out of disk space) Raw longs here: https://pipelines.actions.githubusercontent.com/6HcEdXX4QosuWhCiyfFIsk7uCtk1AKLuOCm1NIJ1Yrn64POax9/_apis/pipelines/1/runs/54/signedlogcontent/465?urlExpires=2021-02-17T22%3A38%3A02.7320963Z&urlSigningMethod=HMACV1&urlSignature=M6DSoXBN1qeWhLLzsAkL3IEGQVZX3N2etuxQgUzfxWA%3D The cache is clean at the start of the build. Cache is fetched from inline caches on multiple artefacts on docker hub. Its happing locally as well. Locally it tends to eventually fail with a cache item size mismatch error (i'm presuming due to running out of space on the builder docker instance) I've gone over all the COPY --from lines, they all have unique destination folders. |
I can't reproduce because the images where you import cache are private and running the build skips them for me. So I can't see what is weird in one of these images. If you can't get me access to the images maybe you can give me steps how to generate cache into a local registry so that it hits this issue on import. |
My hunch is that because you import so many cache sources, somewhere there is a loop. Eg. file is copied A into B, but based on another cache source builder determines that B is the same as A or maybe its parent and then gets confused. |
I'm a little confused: all the images on the shapebuild dockerhub account are public. To reproduce locally use |
I've looked for loops, but I can't spot any. Could it be possible that some of the build stages are generating identical files that cause the clash? |
I didn't know this was the |
I just realised that... :)
…On Thu, 18 Feb 2021 at 07:33, Tõnis Tiigi ***@***.***> wrote:
I didn't know this was the CACHE_ID. It's masked in the logs
https://github.com/robtaylor/openlane/runs/1914403173?check_suite_focus=true#step:8:14
—
You are receiving this because you authored the thread.
Reply to this email directly, view it on GitHub
<#1984 (comment)>, or
unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAB4O4FQSXOM4LYT7KESM3DS7S7GLANCNFSM4XYMUMGQ>
.
|
I can reproduce now. Will look again tomorrow. If you need to update these cache source images please let me know first and I'll make a backup of them for my testing. |
So it looks like there isn't actually any loop, it just looks like it is because of the massive amount of data. 2 issues are in play here. First is an issue with limiting concurrency when pulling the layers. It does not work properly with so many separate images being pulled in parallel. This means too many tcp connections are created causing things to break down (I got an error from cloudflare). The second issue is excessive progress rows in I pushed an image with the concurrency fix With that image I didn't have issues with pulling the cache. It took about 5min though with almost 400 requests. I didn't wait until the qemu builds finished but everything up to that point seemed normal. https://gist.github.com/tonistiigi/66f16d8daf6750d29ce8f51ac9a228c2 |
That's amazing work, thank you Toni! Out of interest, why is it making 400 requests? |
There are Did you have a chance to validate |
@tonistiigi For some reason, it's now decided to rebuild all the dependencies. Could that be an effect of this patch? interestingly, it worked fine locally a couple of times, but is now behaving the same as the above build. |
@tonistiigi @AkihiroSuda this isn't fixed for me... why is it closed? |
Thanks @AkihiroSuda ! |
any more thoughts on this? |
I created another issue with infinite loop with cached layers #2009, doesn't looks like duplicate after reading comments here. |
I have a public build here that's looping indefinitely. The build is using latest stable.
https://github.com/robtaylor/openlane/runs/1910363514?check_suite_focus=true
The text was updated successfully, but these errors were encountered: