Migrate disk image building to systemd-repart #1276

DaanDeMeyer · 2022-11-22T09:32:26Z

Let's switch to systemd-repart to build disk images. systemd-repart
allows building disk images declaratively. repart will assemble a
disk image according to a set of partition definition files, which
it will search for in a set of predefined locations or in the locations
provided via the --definitions option. Partition definition files are
defined in the usual systemd ini style and define a single partition
each. Many aspects of the partition can be configured, including its
size, type, and how it should be populated.

Of note should be the CopyFiles= setting that allows providing a set
of files/directories from which the partition should be populated. The
CopyFiles= setting takes a list of path pairs, where the first path in
a pair defines the source location, and the second path the destination
location in the partition. If combined with repart's --root or --image
options, the source path is resolved relative to the given root directory.

Along with switching to systemd-repart, we also rework how we build
images. Previously, we first provisioned the disk image with all its
partitions, then mounted the disk image and finally populated it with
contents. Now, we provision to a regular directory on the host filesystem
first, followed by creating the disk image with systemd-repart. The partition
definition files can be supplied via the --repart-directory option or
in the mkosi.repart/ directory. If not provided, a default configuration
consisting of an automatically sized root partition and an ESP partition of
256M is used.

repart's --root switch is used so that the partition definition files
don't need to encode the full path to the root directory of the image
we're building. For example, to copy the root directory to a root
partition, it's sufficient to specify "CopyFiles=/:/" in the definition
file.

If we're building non-bootable images, any ESP partition definitions
are excluded. To allow building UKIs with an embedded roothash cmdline
parameter, we run repart twice, first to populate all partitions except
the ESP/XBOOTLDR partitions, and again to populate the ESP/XBOOTLDR
partitions including any UKIs with embedded roothash cmdline parameter
(which we determined when running repart the first time).

Because we don't know up-front anymore where the ESP partition will be
mounted, all boot loader files are installed to /boot. So to populate
an ESP partition, you'd use "CopyFiles=/boot:/" in the partition
definition file of the ESP partition.

Also, since we don't know up-front anymore which filesystem we'll be
building for, we stop installing filesystem related packages by default.
Users will be required to add the necessary filesystem packages themselves.
Similarly, we also stop installing cryptsetup and device-mapper by default
since we don't know upfront anymore whether partitions will be verity
protected or encrypted.

By switching to systemd-repart, we also set ourselves up for building
images without needing root privileges or loop devices. systemd-repart
is fully capable (from v253 onwards) of building disk images without
needing root privileges or loop devices. After we switch to
systemd-repart, we'll only need root privileges to be able to run
systemd-nspawn (which should not be necessary in the future anymore).

This PR also removes all the option related to disk image building that
can now be specified in repart definition files instead. This includes:

Format=gpt_xxx options are replaced with a single "disk" options.
Filesystem can now be specified with repart's Format= option
Format=plain_squashfs (Can be reproduced by a single repart squashfs
root partition combined with SplitArtifacts=yes)
Verity= (Replaced by repart's Verity= options)
Encrypt= (Replaced by repart's Encrypt= option)
RootSize=, HomeSize=, VarSize=, TmpSize=, ESPSize=, SwapSize=, SrvSize=
(Replaced by repart's size options)
UsrOnly= (replaced with CopyFiles=/:/usr in a usr partition definition)
OutputSplitRoot=, OutputSplitVerity=, (Replaced by repart's SplitName= option)
OutputSplitKernel= (UKI is now always written to its own output file)
GPTFirstLBA (Removed, no equivalent in repart)
ReadOnly= (Replaced by repart's ReadOnly= option per partition)
Minimize= (Replaced by repart's Minimize= option per partition)
CompressFs= (No equivalent in repart, can be replicated by replacing mkfs.
in $PATH with a script that adds the necessary command line option)
CompressOutput= is renamed to Compress=
MkSquashfs= (Can be replaced with a script in $PATH that invokes
the correct binary)

We also remove the WithoutUnifiedKernelImages= switch as building unified
kernel images is trivial and fast these days.

action.yaml

DaanDeMeyer · 2022-11-22T15:45:10Z

This also gets rid of the pytest integration tests we added a while ago. Given that we'll be moving to integrating more with build systems in the future, building and booting the image as part of the same integration test seems like the wrong way to go. Instead we should just build the images in one step in a build system such as meson and then have the integration test(s) depend on it. For now, I switched us to the same approach used in systemd which does the trick as well.

DaanDeMeyer · 2022-11-25T12:49:53Z

I also switched us from using temporary files and passing around Optional[BinaryIO] everywhere to a staging directory that all functions that produce output write into and look into for dependent files. This was necessary because systemd-repart produces one or more output files which we can't account for with the current model and because it simplifies everything quite a bit as well.

Adding this option was a mistake (mea culpa), we should limit host impact on image builds as much as possible so let's drop this option that makes the initrd generated by dracut host specific. This will slow down initrd generation but once we switch to generating initrds with mkosi we'll have a proper fix for that.

keszybz · 2022-12-20T14:20:38Z

Format=plain_squashfs (Can be reproduced by a single repart squashfs
root partition combined with SplitArtifacts=yes)

I find this problematic. I expect that it'll make building of initrds more expensive. But I think it's not a blocker… In the worst case we can always add some support to build such images directly again.

CompressFs= (No equivalent in repart, can be replicated by replacing mkfs.
in $PATH with a script that adds the necessary command line option)

I don't think the approach of requiring the user to create fake wrappers to pass options is acceptable in the long run. But it shouldn't be too hard to add support for passing compression options again.

CompressOutput= is renamed to Compress=

Can we please not do that. There's bound to be many different compression settings. No need to recreate the old issue that was once solved.

keszybz

I think we should merge this asap.

.github/mkosi.extra/usr/lib/systemd/system/mkosi-check-and-shutdown.service

.github/workflows/ci.yml

mkosi/__init__.py

DaanDeMeyer · 2022-12-21T11:13:36Z

I don't think the approach of requiring the user to create fake wrappers to pass options is acceptable in the long run. But it shouldn't be too hard to add support for passing compression options again.

It's not a mkosi problem anymore after we merge this. It'll be a repart problem. If we want to do this slightly nicer we'll need to have repart read extra mkfs options from environment variables like we do in homed and such.

I find this problematic. I expect that it'll make building of initrds more expensive. But I think it's not a blocker… In the worst case we can always add some support to build such images directly again.

cpio building hasn't changed, we only remove the plain squashfs output format because the complexity to keep it is bigger than then the added benefit of supporting it. cpio and tar output formats are still supported as before. (Unless you're saying that you want squashfs initrds?)

Can we please not do that. There's bound to be many different compression settings. No need to recreate the old issue that was once solved.

Reverted this, we drop CompressFs and keep CompressOutput and remove Compress.

DaanDeMeyer · 2022-12-21T11:22:25Z

CI failures are unrelated, shall we merge this and fix any remaining issues in follow up PRs?

behrmann

Sounds good to me, one last thing I noticed.

behrmann · 2022-12-21T11:25:45Z

.gitignore

@@ -12,7 +12,6 @@
 /dist
 /mkosi.build
 /mkosi.egg-info
-/mkosi.extra


With the files moved this isn't necessary anymore, no?

The current approach where we build the image as part of the integration test is the wrong approach. Instead, we'll move to integration tests that are integrated with a build system where building images and using them in integration tests are separate steps. Building an image for use in an integration test will be a regular target (custom_target()) in a build system. Building an image that's not intended to be used in any other test will be a regular test. Because this model is going to be substantially different from what we have now, let's get rid of the integration test machinery we added and temporarily switch to the basic approach that's also used in the systemd repo for integration tests.

It already was non-optional in practice, let's encode this in the type.

Let's switch to systemd-repart to build disk images. systemd-repart allows building disk images declaratively. repart will assemble a disk image according to a set of partition definition files, which it will search for in a set of predefined locations or in the locations provided via the --definitions option. Partition definition files are defined in the usual systemd ini style and define a single partition each. Many aspects of the partition can be configured, including its size, type, and how it should be populated. Of note should be the CopyFiles= setting that allows providing a set of files/directories from which the partition should be populated. The CopyFiles= setting takes a list of path pairs, where the first path in a pair defines the source location, and the second path the destination location in the partition. If combined with repart's --root or --image options, the source path is resolved relative to the given root directory. Along with switching to systemd-repart, we also rework how we build images. Previously, we first provisioned the disk image with all its partitions, then mounted the disk image and finally populated it with contents. Now, we provision to a regular directory on the host filesystem first, followed by creating the disk image with systemd-repart. The partition definition files can be supplied via the --repart-directory option or in the mkosi.repart/ directory. If not provided, a default configuration consisting of an automatically sized root partition and an ESP partition of 256M is used. repart's --root switch is used so that the partition definition files don't need to encode the full path to the root directory of the image we're building. For example, to copy the root directory to a root partition, it's sufficient to specify "CopyFiles=/:/" in the definition file. If we're building non-bootable images, any ESP partition definitions are excluded. To allow building UKIs with an embedded roothash cmdline parameter, we run repart twice, first to populate all partitions except the ESP/XBOOTLDR partitions, and again to populate the ESP/XBOOTLDR partitions including any UKIs with embedded roothash cmdline parameter (which we determined when running repart the first time). Because we don't know up-front anymore where the ESP partition will be mounted, all boot loader files are installed to /boot. So to populate an ESP partition, you'd use "CopyFiles=/boot:/" in the partition definition file of the ESP partition. Also, since we don't know up-front anymore which filesystem we'll be building for, we stop installing filesystem related packages by default. Users will be required to add the necessary filesystem packages themselves. Similarly, we also stop installing cryptsetup and device-mapper by default since we don't know upfront anymore whether partitions will be verity protected or encrypted. By switching to systemd-repart, we also set ourselves up for building images without needing root privileges or loop devices. systemd-repart is fully capable (from v253 onwards) of building disk images without needing root privileges or loop devices. After we switch to systemd-repart, we'll only need root privileges to be able to run systemd-nspawn (which should not be necessary in the future anymore). This PR also removes all the option related to disk image building that can now be specified in repart definition files instead. This includes: - Format=gpt_xxx options are replaced with a single "disk" options. Filesystem can now be specified with repart's Format= option - Format=plain_squashfs (Can be reproduced by a single repart squashfs root partition combined with SplitArtifacts=yes) - Verity= (Replaced by repart's Verity= options) - Encrypt= (Replaced by repart's Encrypt= option) - RootSize=, HomeSize=, VarSize=, TmpSize=, ESPSize=, SwapSize=, SrvSize= (Replaced by repart's size options) - UsrOnly= (replaced with `CopyFiles=/:/usr` in a usr partition definition) - OutputSplitRoot=, OutputSplitVerity=, (Replaced by repart's SplitName= option) - OutputSplitKernel= (UKI is now always written to its own output file) - GPTFirstLBA (Removed, no equivalent in repart) - ReadOnly= (Replaced by repart's ReadOnly= option per partition) - Minimize= (Replaced by repart's Minimize= option per partition) - CompressFs= (No equivalent in repart, can be replicated by replacing mkfs.<fs> in $PATH with a script that adds the necessary command line option) - MkSquashfs= (Can be replaced with a script in $PATH that invokes the correct binary) We also remove the WithoutUnifiedKernelImages= switch as building unified kernel images is trivial and fast these days.

DaanDeMeyer · 2022-12-21T13:44:43Z

Only changed .gitignore, let's merge

DaanDeMeyer force-pushed the repart branch 10 times, most recently from cf2cfba to 88eba28 Compare November 22, 2022 13:45

bluca reviewed Nov 22, 2022

View reviewed changes

action.yaml Outdated Show resolved Hide resolved

DaanDeMeyer force-pushed the repart branch from 88eba28 to b7e47ee Compare November 22, 2022 14:11

DaanDeMeyer force-pushed the repart branch 11 times, most recently from 159f10f to d65ac6d Compare November 25, 2022 12:46

DaanDeMeyer force-pushed the repart branch 5 times, most recently from 5c181b1 to 694faa6 Compare November 25, 2022 20:32

DaanDeMeyer force-pushed the repart branch 2 times, most recently from 6a17947 to c1c679c Compare December 15, 2022 14:25

DaanDeMeyer added 2 commits December 15, 2022 14:45

Bump alma/rocky default release to "9"

fee9cdf

DaanDeMeyer force-pushed the repart branch 3 times, most recently from dd9301d to e00b185 Compare December 15, 2022 14:57

This was referenced Dec 19, 2022

gpt-auto-root fails (A start job is running for /dev/gpt-auto-root) #1283

Closed

Remove annotations import from __future__ #1280

Merged

keszybz requested changes Dec 20, 2022

View reviewed changes

DaanDeMeyer mentioned this pull request Dec 20, 2022

--secureboot is broken when --incremental is used #1288

Closed

DaanDeMeyer force-pushed the repart branch 4 times, most recently from afabc04 to 5901cbb Compare December 21, 2022 11:06

behrmann approved these changes Dec 21, 2022

View reviewed changes

DaanDeMeyer added 4 commits December 21, 2022 14:44

Make config.build_sources non-optional

5db651b

It already was non-optional in practice, let's encode this in the type.

mypy: Fix version comparison

7444c2e

DaanDeMeyer force-pushed the repart branch from 5901cbb to 8bbbd83 Compare December 21, 2022 13:44

DaanDeMeyer merged commit 381a23f into systemd:main Dec 21, 2022

DaanDeMeyer deleted the repart branch December 21, 2022 13:44

This was referenced Dec 21, 2022

Fix failure using BaseImage with a filesystem image #1260

Closed

[gentoo] preper for usrmerge #1173

Merged

behrmann mentioned this pull request Dec 21, 2022

mkosi has switched to systemd-repart #1290

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Migrate disk image building to systemd-repart #1276

Migrate disk image building to systemd-repart #1276

DaanDeMeyer commented Nov 22, 2022 •

edited

Loading

DaanDeMeyer commented Nov 22, 2022

DaanDeMeyer commented Nov 25, 2022

keszybz commented Dec 20, 2022

keszybz left a comment

DaanDeMeyer commented Dec 21, 2022

DaanDeMeyer commented Dec 21, 2022 •

edited

Loading

behrmann left a comment

behrmann Dec 21, 2022

DaanDeMeyer commented Dec 21, 2022

Migrate disk image building to systemd-repart #1276

Migrate disk image building to systemd-repart #1276

Conversation

DaanDeMeyer commented Nov 22, 2022 • edited Loading

DaanDeMeyer commented Nov 22, 2022

DaanDeMeyer commented Nov 25, 2022

keszybz commented Dec 20, 2022

keszybz left a comment

Choose a reason for hiding this comment

DaanDeMeyer commented Dec 21, 2022

DaanDeMeyer commented Dec 21, 2022 • edited Loading

behrmann left a comment

Choose a reason for hiding this comment

behrmann Dec 21, 2022

Choose a reason for hiding this comment

DaanDeMeyer commented Dec 21, 2022

DaanDeMeyer commented Nov 22, 2022 •

edited

Loading

DaanDeMeyer commented Dec 21, 2022 •

edited

Loading