-
Notifications
You must be signed in to change notification settings - Fork 18.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
"Save our build times" aka Add support for mounts during docker build #31499
Conversation
c9fea15
to
66c8a12
Compare
ping @tonistiigi @cpuguy83 PTAL |
|
also, consideration for build cache is needed |
780daa0
to
0df7664
Compare
446f717
to
f872798
Compare
@thaJeztah @cpuguy83 @AkihiroSuda @tonistiigi this is now ready for design review please, before I go deep into covering test cases and code conventions, I'd like to make sure I'm on the right track 👍 This was the quickest path to opening up for feedback. Not sure that's how PRs to docker work, if it needs to be more feature complete before design review please let me know, I can do that as well. (btw - complete noob to docker codebase here) |
Thanks! I added this for the upcoming maintainers meeting, but perhaps @tonistiigi has time to look before that. He's working on some other improvements to the builder, so we should verify this addition doesn't "conflict" with the other enhancements |
Don't know if this is useful. But it seems these guys have already solved it https://github.com/grammarly/rocker. |
9de69a6
to
f4b7de5
Compare
f4b7de5
to
62a3984
Compare
Signed-off-by: Subhas Dandapani <[email protected]>
Signed-off-by: Subhas Dandapani <[email protected]>
62a3984
to
f29e2ef
Compare
Rocker could be a good option for build-time mounts. Have you had a chance to check it out @rdsubhas ? |
Hi @rdsubhas, As you have probably already read, this has been discussed many times previously. There are many reasons why I would feel hesitant about adding such a feature. It makes builds not self-contained, encourages creating broken dockerfiles, builds can't be tracked to immutable sources, invalidates caching, breaks remote usage of Docker(as bind mounts are not part of API) etc. Let's look at the actual use-cases behind this solutions: Performance, mounting is faster than copying context when there are lots of filesI'm working on a prototype of incremental contexts sends and can share more details soon. This means that repeated builds should not perform worse when the size of the context increases. Once this is in place and we have an immutable reference to the input data in daemon, we could think about mounting it to commands read-only if that makes more sense than There are many other cases how the build performance could be improved. If you are interested in helping out in some of them we should discuss more. In an earlier issue, there was also a comment about this being a workaround for smaller images. I've proposed #31257 for that. Need to share secret data to a running processThere is an open PR #30637. The benefit of that PR instead of this one is that it is clearly limiting the use case to secret data what makes it harder to have a completely broken Dockerfile or misuse it in another way. Secrets are clearly defined and we could combine them with a Dockerfile directive in the future so that required secrets could be defined in a similar way Another way to fix the secret problem would be to use an external service that provides trusted information (from swarm cluster, client, etc). Need to include directory outside of build contextWhile this is also encouraging broken Dockerfiles, sometimes, especially while developing and overriding other projects, it seems helpful to not require everything to be in the same folder. A solution for that has been discussed in #30101 (comment) . It is quite similar to this PR but "mounts" the data to context instead of the container. |
The main use case for something like this that sticks out to me (and was mentioned in #14080 but not explicitly here) is the case of Gentoo-based images. For Gentoo, the "packaging" files are stored in Relatedly, while building packages, the source of the packages being built gets downloaded to So in summary, the main use cases I see for "outside content" (based mostly on @tonistiigi's list above) are:
|
@tonistiigi I agree that making docker build mutable and inconsistent is not right but no one in their right might would do such a build for a production image. In our projects we have a ton of node_modules and each time someone in the team changes one of our private packages we need to do a clean npm install again in our development docker container and that takes almost 30 minutes and we also get rate limited by npm many times for doing too many updates in a day. I tried all sorts of ways to cache the node_modules and even tried yarn to save the packages offline so that they will be available in the build context but to no avail. Even tried rocker and that also didn't work. The only other way I see of solving this is running the npm install command in our existing container to update the packages and commit the container as the new latest image and that would solve our developer woes. So its okay if its not immutable since its development but can we have a command that does this or I need to write a python script to do this from now on. Docker has eased our devops work but substantially increased the developers work and we get pounded by the developers all the time because of this. |
Sorry for responding late, different timezones, @alexellis Yes indeed we have checked out rocker, but wherever possible we'd like to be compliant with normal upstream docker, especially since its moved to shorter release cycles. @tonistiigi I think @tianon's comment and others in #14080 have really collated a long list of issues. Some of us work in the corporate world where we don't really have the simplicity of doing short and sweet Maybe I don't have the right words to summarize this, but for us, this problem has grown beyond categorizing and into the "free-form-tagging" zone (most in #14080) What we see is simple: "a clean Atleast, docker with mounts gives us a more "predictable" stateless docker build process where we can improvise if a folder is empty (whether its bind-mounted or not), and we can have one Dockerfile that works idempotently whether a folder was mounted or not (i.e. trust developers with the last mile idempotency). |
@thaJeztah thanks! will close this... |
@rdsubhas thanks! I didn't want to close because no decision was made yet, but would definitely appreciate your input on those proposals 👍 |
What this does
--mount
s to be specified duringdocker build
docker build --mount type=bind,source=/hostpath,target=/containerpath ...
Checklist
ls
the mount only while building, and does not exist in image metadata or when running--mount
instead of-v
/build
api, cli, etc--no-cache
) if mount definitions are changedmount
options are less documented in general, difference betweendocker run -v
anddocker build --mount
, general warnings and disclaimers, how to invalidate cache, etcADD/COPY
inside build context/workdir, etcSigned-off-by: Subhas Dandapani [email protected]
Obligatory picture of how it feels like in the beginning. Slowly warming up. Great developer docs 👍 It was really shocking how simple this was to get working though. All these years of orchestrating
docker build
as much as (or sometimes more than)docker run
for sake of performance and developer productivity feels like a lie. Hope we can move forward...