Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fuse-overlayfs: enable shifting #195

Merged
merged 10 commits into from
Jul 27, 2018

Conversation

giuseppe
Copy link
Member

FUSE overlayfs has builtin support for UID/GID shifting so that we don't need to re-create a new image when using different mappings for the user namespaces.

When the FUSE backend is used, don't create a new image and chown it, but let the FUSE backend do the remapping on the fly.

Starting with an empty /var/lib/containers/storage:

# podman pull fedora
# for i in $(seq 10)
> do
>     bin/podman --storage-opt=overlay.fuse_program=/usr/local/bin/fuse-overlayfs run  \
       --uidmap=0:$i:10000 --gidmap=0:$i:10000 --rm fedora \
        stat -c "owner is: %U" /usr/bin/ls
>     du --si --summarize /var/lib/containers/storage/
>     echo
> done
owner is: root
286M	/var/lib/containers/storage/

owner is: root
286M	/var/lib/containers/storage/

owner is: root
286M	/var/lib/containers/storage/

owner is: root
286M	/var/lib/containers/storage/

owner is: root
286M	/var/lib/containers/storage/

owner is: root
286M	/var/lib/containers/storage/

owner is: root
286M	/var/lib/containers/storage/

owner is: root
286M	/var/lib/containers/storage/

owner is: root
286M	/var/lib/containers/storage/

owner is: root
286M	/var/lib/containers/storage/

/cc @rhatdan @nalind @rhvgoyal @AkihiroSuda

@giuseppe giuseppe force-pushed the fuse-overlayfs-shifting branch 6 times, most recently from 89ca5f1 to cec74cd Compare July 11, 2018 13:44
@rhatdan
Copy link
Member

rhatdan commented Jul 12, 2018

LGTM
@nalind PTAL

@giuseppe
Copy link
Member Author

I'll probably need to rebase this PR on top of #199

@giuseppe
Copy link
Member Author

let's merge the other one first, as it addresses some real issues :-)

@giuseppe giuseppe force-pushed the fuse-overlayfs-shifting branch from cec74cd to 45588a6 Compare July 17, 2018 07:07
@giuseppe
Copy link
Member Author

rebased ⬆️

@rhatdan
Copy link
Member

rhatdan commented Jul 17, 2018

Should supports_shifting be a flag in the storage.conf? Rather then force any mount driver to support shifting?

@giuseppe
Copy link
Member Author

I am personally in favor of making it configurable, but I understood @nalind would prefer to keep this simpler for now. @nalind what do you think?

@nalind
Copy link
Member

nalind commented Jul 17, 2018

I don't think that making assumptions based on whether or not the mount_program option is set is going to work if we need to keep adding variations on how overlay works.

@@ -1033,16 +1038,22 @@ func (s *store) imageTopLayerForMapping(image *Image, ristore ROImageStore, read
}
rc, err := layerHomeStore.Diff("", layer.ID, &diffOptions)
if err != nil {
return nil, errors.Wrapf(err, "error reading layer %q to create an ID-mapped version of it")
return nil, errors.Wrapf(err, "error reading layer %q to create an ID-mapped version of it", layer.ID)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

👍

@nalind
Copy link
Member

nalind commented Jul 17, 2018

This needs tests.

How does a caller "know" which ID mappings to use when launching a container using a layer with this setting enabled as its rootfs? Previously, they'd consult the information recorded along with the layer, but we appear to be setting that information to indicate that host mappings should be used, while quietly using the global default mappings for the actual mounting.

What happens when we create a new layer that uses the new feature, using a parent layer that didn't, but which used a non-default set of ID mappings?

@giuseppe giuseppe force-pushed the fuse-overlayfs-shifting branch from 45588a6 to ce9d2bf Compare July 18, 2018 11:55
@giuseppe
Copy link
Member Author

How does a caller "know" which ID mappings to use when launching a container using a layer with this setting enabled as its rootfs? Previously, they'd consult the information recorded along with the layer, but we appear to be setting that information to indicate that host mappings should be used, while quietly using the global default mappings for the actual mounting.

I thought I could propagate this information from podman, for example when starting a container that was previously created. I tried to do it, but it doesn't look nice. Where is the best place to store this information? When we mount directly the layer we don't know what mappings were used when it was created

@nalind
Copy link
Member

nalind commented Jul 18, 2018

We keep track of the mappings that a layer is created for using its UIDMap and GIDMap fields. We use that information when creating layers to set permissions in the layer, possibly reverting mappings inherited from the parent layer as part of the permissions-changing process, and then make it available to higher-level tools so that they can, in turn, pass them on to runtime tools.
CreateContainer() passes mappings that it's asked to use down to PutLayer(), and then keeps a copy of what it ends up using in the container's IDMappingOptions for the convenience of callers that don't want to deal with individual layers.

@giuseppe
Copy link
Member Author

giuseppe commented Jul 18, 2018

should I store that we are using shifting or we should allow only one mode to be used (either shifting enabled or not)?

Also, how should I pass down the information about uid/gid mappings into the driver? We add two more arguments to Get(..)?

@giuseppe giuseppe force-pushed the fuse-overlayfs-shifting branch 6 times, most recently from 3a5c2c8 to 10b55c3 Compare July 18, 2018 20:12
@giuseppe
Copy link
Member Author

test added ⬆️

@giuseppe giuseppe force-pushed the fuse-overlayfs-shifting branch 3 times, most recently from 8343967 to 0acf8e9 Compare July 19, 2018 08:13
@giuseppe
Copy link
Member Author

is the new option blocking this PR?

@giuseppe
Copy link
Member Author

I'd like to get this moving, should I add an option for enabling shifting or is it fine as the current PR does to assume it is supported by the FUSE implementation?

@rhatdan
Copy link
Member

rhatdan commented Jul 25, 2018

I think we should just assume the mount_program does the shifting.

@@ -4,20 +4,6 @@ set -xe
source /etc/os-release

case "${ID_LIKE:-${ID:-unknown}}" in
debian)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why are you removing debian testing?

@@ -2,7 +2,7 @@

STORAGE_BINARY=${STORAGE_BINARY:-$(dirname ${BASH_SOURCE})/../containers-storage}
TESTSDIR=${TESTSDIR:-$(dirname ${BASH_SOURCE})}
STORAGE_DRIVER=${STORAGE_DRIVER:-vfs}
STORAGE_DRIVER=${STORAGE_DRIVER:-overlay}
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@nalind is this something that will work everywhere?

@@ -2,7 +2,7 @@

STORAGE_BINARY=${STORAGE_BINARY:-$(dirname ${BASH_SOURCE})/../containers-storage}
TESTSDIR=${TESTSDIR:-$(dirname ${BASH_SOURCE})}
STORAGE_DRIVER=${STORAGE_DRIVER:-vfs}
STORAGE_DRIVER=${STORAGE_DRIVER:-overlay}
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm not okay with changing this testing default -- it trades running tests with the driver that always has to work for running them with another.

overlay*)
;;
*)
skip "not supported by driver $STORAGE_DRIVER"
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Indentation for these two lines is backwards.

@nalind
Copy link
Member

nalind commented Jul 25, 2018

If the mount program takes care of shifting, then we basically can never not use the option on a given layer once we've set up a layer that needs it, even if the configuration changes to no longer include using the mount program that provides the feature, because the layer essentially becomes "broken". We need to be keep track of this somehow to avoid creating a problem for callers when this happens.

giuseppe added 8 commits July 26, 2018 06:10
do not create a new image with different uid/gid if the driver has
support for shifting.

Signed-off-by: Giuseppe Scrivano <[email protected]>
Signed-off-by: Giuseppe Scrivano <[email protected]>
@giuseppe giuseppe force-pushed the fuse-overlayfs-shifting branch from 0acf8e9 to 79d12e0 Compare July 26, 2018 04:16
@giuseppe
Copy link
Member Author

@nalind the layer is stored without any mapping done. The translation happens only at runtime. For example, from a podman container with --uidmap 0:100000:100000:

# touch foobar foobar2
# chmod 998:999 foobar2

and from the host:

# find /var/lib/containers/storage/ -name 'foobar*' -exec stat --format="%n uid=%u gid=%g" \{\} \;
/var/lib/containers/storage/overlay/50119762b2a2366b0fa3ddaa79c7c506799a5e66d961cf53b59ac4bb78552afe/merged/foobar uid=100000 gid=100000
/var/lib/containers/storage/overlay/50119762b2a2366b0fa3ddaa79c7c506799a5e66d961cf53b59ac4bb78552afe/merged/foobar2 uid=100998 gid=100999
/var/lib/containers/storage/overlay/50119762b2a2366b0fa3ddaa79c7c506799a5e66d961cf53b59ac4bb78552afe/diff/foobar2 uid=998 gid=999
/var/lib/containers/storage/overlay/50119762b2a2366b0fa3ddaa79c7c506799a5e66d961cf53b59ac4bb78552afe/diff/foobar uid=0 gid=0

so the shifting has the opposite effect of running with a mapping enabled, are there cases when this is not enough?

@nalind
Copy link
Member

nalind commented Jul 27, 2018

We're changing what's stored in the Layer structure when we store a layer this way, in that it may no longer agree with the copy we store in the Container structure. That copy was originally added there as a convenience, to save callers from having to look up the mappings by looking up the Layer structure using the ID stored in the Container structure. I don't know how that'll impact current callers.
Leaving that aside, if the configuration is changed so that the mount helper is no longer used, we'll quietly discard the maps stored in the Container structure and mount the layer with no mappings, but if the caller is using the mapping information in the Container structure, things will break, so there needs to be an error check there.

@giuseppe
Copy link
Member Author

yes good point, we should error out when we try to mount the same layer/container and not using shifting. I've added a patch for doing it, now I've:

# podman --storage-opt overlay.mount_program=/usr/local/bin/fuse-overlayfs run --uidmap 0:100000:100000 --name test1 fedora echo hello
hello
# podman start -a test1
unable to start container b991456592f83b96b7675b54bdde84d0249686f1eec676261051f39f74b831f8: error mounting storage for container b991456592f83b96b7675b54bdde84d0249686f1eec676261051f39f74b831f8: cannot mount layer 3160cc1b237be611c580ddb2a61076491506caf1b8f9ff0080e5a2dbaa353d1a: shifting not enabled

createrandom "$lowermount"/file
storage unmount $layer

imagename=idmappedimage-shifting
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Mixing tabs and spaces makes this diff look weird.

@nalind
Copy link
Member

nalind commented Jul 27, 2018

LGTM

Signed-off-by: Giuseppe Scrivano <[email protected]>
@giuseppe giuseppe force-pushed the fuse-overlayfs-shifting branch 2 times, most recently from 1b4c904 to 4ff0bde Compare July 27, 2018 15:26
@giuseppe
Copy link
Member Author

@rhatdan tests are passing

@rhatdan rhatdan merged commit 956a197 into containers:master Jul 27, 2018
@runcom runcom mentioned this pull request Nov 9, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants