cmake: compute and display the reproducible checksum by default #51954

marc-hb · 2022-11-04T01:49:30Z

Two commits, second commit message copied below.

tl;dr

This PR makes a huge difference for people who care about security and reproducible builds more generally.
This PR makes no observable difference for people who don't care,
The reproducible zephyr.strip is only a means to an end. Alternative ways to achieve the same purpose are welcome... as long as they don't require hundreds times more lines of code. This PR adds 2 lines.

If 0.5% additional build/ disk space is considered too much then one alternative is to delete zephyr.strip immediately after checksumming it. However the easiest way to automate #50205 is to compare zephyr.strip files.

An earlier and incomplete version of this was previously (and negatively) reviewed by a few people in

Kconfig: enable CONFIG_BUILD_OUTPUT_STRIPPED by default #51737

Temporary bugs, corner cases and obsolete toolchains aside, the Zephyr
build is reproducible most of the time: #50205 and #14593

This means two different build machines using the same toolchain will
always produce the same binary output. These 2 lines make
it trivial to verify that binary outputs are indeed the same by adding this
single line in the build logs:

[16/16] Linking C executable zephyr/zephyr.elf
Memory region         Used Size  Region Size  %age Used
             RAM:       53280 B         3 MB      1.69%
        IDT_LIST:          0 GB         2 KB      0.00%
fdd2ddf2ad7d5da5bbd79b41cef8d7b161a896b983cdaef549a8281111d8e205  zephyr.strip

The second commit enables that feature by default because build
reproducibility matters for (at least) two important reasons:

Security / supply chain attacks, see https://www.cisa.gov/sbom, Verify builds are reproducible in the CI #50205,
https://reproducible-builds.org/ and many others.
Making sure build configurations are strictly identical when trying to
reproduce elusive issues or when issuing releases.

It was of course already possible to manually make this Kconfig change
and manually compute this checksum. However this can be impossible when
dealing with an automated build system that does not archive all
intermediate (#5009) files like zephyr.elf. Tweaking the build
configuration can also be difficult and error-prone for people who are
not Zephyr developers.

Most automated CI systems preserve build logs by default.

Displaying the reproducible checksum by default accelerates the
discovery of reproducibility bugs like #48195.

When measured with west build -p -b qemu_x86 samples/hello_world/, the
additional build/zephyr/zephyr.strip disk space required is 43
kilobytes compared to a total of 11 Megabytes. Measuring a more
realistic SOF example, zephyr.strip weighed 690 kb which was about
0.1% of a total build/ directory weighing 65M.

To measure the build time cost I ran "west build -p -b qemu_x86
samples/hello_world/" many times in a loop with and without this PR on
my Linux workstation. Stripping and checksumming made literally no time
difference compared to the "noise" observed when building the same
configuration. This is not surprising considering how small
zephyr.strip is: so the extra cost is most likely dominated by process
creation and the total number of processes created during a Zephyr build
dwarfs the few extra processes required by this feature.

More surprisingly, I measured incremental builds by running
touch kernel/timer.c; west build ... in a loop and I could not
observe any visible time difference either.

Temporary bugs, corner cases and obsolete toolchains aside, the Zephyr build is most of the time reproducible: zephyrproject-rtos#50205 and zephyrproject-rtos#14593. This means two different build machines using the same toolchain will always produce the same binary output. The one-line addition in this commit makes it trivial to verify that binary outputs are indeed the same by adding a single checksum line in the build logs: ``` [16/16] Linking C executable zephyr/zephyr.elf Memory region Used Size Region Size %age Used RAM: 53280 B 3 MB 1.69% IDT_LIST: 0 GB 2 KB 0.00% fdd2ddf2ad7d5da5bbd79b41cef...7b16ef549a8281111d8e205 zephyr.strip ``` This commit makes a non-measurable build time difference. Build reproducibility matters for (at least) two important reasons: - Security / supply chain attacks, see https://www.cisa.gov/sbom, zephyrproject-rtos#50205, https://reproducible-builds.org/ and many others. - Making sure build configurations are strictly identical when trying to reproduce elusive issues or when issuing releases. Displaying a reproducible checksum accelerates the investigation of temporary reproducibility issues like zephyrproject-rtos#48195. Signed-off-by: Marc Herbert <[email protected]>

Temporary bugs, corner cases and obsolete toolchains aside, the Zephyr build is reproducible most of the time: zephyrproject-rtos#50205 and zephyrproject-rtos#14593 This means two different build machines using the same toolchain will always produce the same binary output. The previous, one-line commit made it trivial to verify that binary outputs are indeed the same by adding this single line in the buid logs: ``` [16/16] Linking C executable zephyr/zephyr.elf Memory region Used Size Region Size %age Used RAM: 53280 B 3 MB 1.69% IDT_LIST: 0 GB 2 KB 0.00% fdd2ddf2ad7d5da5bbd79b41cef8d7...1a896b989a8281111d8e205 zephyr.strip ``` This commit enables that feature by default because build reproducibility matters for (at least) two important reasons: - Security / supply chain attacks, see https://www.cisa.gov/sbom, zephyrproject-rtos#50205, https://reproducible-builds.org/ and many others. - Making sure build configurations are strictly identical when trying to reproduce elusive issues or when issuing releases. It was of course already possible to _manually_ make this Kconfig change and manually compute this checksum. However this can be impossible when dealing with an automated build system that does not archive all _intermediate_ (zephyrproject-rtos#5009) files like `zephyr.elf`. Tweaking the build configuration can also be difficult and error-prone for people who are not Zephyr developers. Most automated CI systems preserve build logs by default. Displaying the reproducible checksum by default accelerates the discovery of reproducibility bugs like zephyrproject-rtos#48195. When measured with `west build -p -b qemu_x86 samples/hello_world/`, the additional `build/zephyr/zephyr.strip` disk space required is 43 kilobytes compared to a total of 11 Megabytes. Measuring a more realistic SOF example, `zephyr.strip` weighed 690 kb which was about 0.1% of a total `build/` directory weighing 65M. To measure the build time cost I ran `west build -p -b qemu_x86 samples/hello_world/` many times in a loop with and without this PR on my Linux workstation. Stripping and checksumming made literally no time difference compared to the "noise" observed when building the same configuration. This is not surprising considering how small `zephyr.strip`: so the extra cost is most likely dominated by process creation and the total number of processes created during a Zephyr build dwarfs the few extra processes required by this feature. More surprisingly, I measured incremental builds by running `touch kernel/timer.c; west build ...` in a loop and I could not observe any visible time difference either. Signed-off-by: Marc Herbert <[email protected]>

johnandersen777 · 2022-11-04T05:17:24Z

Woohoo! 🥳 Having this output within the build logs makes it easy for for various supply chain security activities to run cross-environment verification. (related)

dbaluta · 2022-11-04T09:58:19Z

Kconfig.zephyr

@@ -575,6 +575,7 @@ endif # BUILD_OUTPUT_UF2



@marc-hb This change is very useful! Thanks.

Also for the incremental builds test, wouldn't make more sense to touch a header file like ./include/zephyr/kernel.h for example.

tejlmand · 2022-11-04T13:03:21Z

Kconfig.zephyr

@@ -575,6 +575,7 @@ endif # BUILD_OUTPUT_UF2

 config BUILD_OUTPUT_STRIPPED
 	bool "Build a stripped binary"
+	default y


do not close a PR that are rejected and open a new PR with same changes, just hoping to get it approve a second time.

Please keep this change in #51737 and let's continue the discussion there.

I explained why in #51737 why I closed it and I linked the two both ways - not hiding anything.

In #51737 I made the mistake of stopping half-way, not checksumming the stripped binary. As a consequence all the (few) reviewers missed the point and focused on the wrong thing that I wrongly put in the PR title: the stripped file. The stripped file is a pure, low-level and mostly invisible implementation detail that does not matter. I don't care about the stripped file, it's only a means to an end. Finding reproducibility bugs like #48195 and all the others listed in #14593 is the only thing that matters.

tejlmand · 2022-11-04T13:37:15Z

CMakeLists.txt

@@ -1679,6 +1679,7 @@ if(CONFIG_BUILD_OUTPUT_STRIPPED)
            $<TARGET_PROPERTY:bintools,strip_flag_infile>${KERNEL_ELF_NAME}
            $<TARGET_PROPERTY:bintools,strip_flag_outfile>${KERNEL_STRIP_NAME}
            $<TARGET_PROPERTY:bintools,strip_flag_final>
+    COMMAND ${CMAKE_COMMAND} -E sha256sum ${KERNEL_STRIP_NAME}


i'm sure this is a very useful feature, just like printing of memory usage.
But still, printing of memory usage is user configurable, so why should sha256 checksum be always on ?
(I do note it's tied to strip)

zephyr/Kconfig.zephyr

Lines 462 to 463 in 261025a

config OUTPUT_PRINT_MEMORY_USAGE

bool "Print memory usage to stdout"

Should this be a general option that users can enable, so that any output file, bin, hex, s19, etc can have this ?
As a matter of fact, bin files are created per default, and a checksum on those would be more safe than using the elf file for checksumming.

I agree. "Strip" should not be synonymous with "compute & print checksum" and vice versa.

andyross

This seems to be getting maybe needlessly contentious?

From my perspective this is a simple feature, a good thing, and an industry best practice that probably should be on by default for general apps. It seems like the big complaints are just about extra build complexity (one added artifact and a hash step at the end)? Maybe we could default it to =n if CONFIG_ZTEST? That would cover about 80% of the CI overhead. Or alternatively have twister/ci set things up to suppress the step?

Broadly: this is a feature for deployed apps. We have a CI so that we can exercise features for real apps, we don't architect our apps to make things easier for CI. :)

andyross · 2022-11-04T15:20:25Z

i'm sure this is a very useful feature, just like printing of memory usage.

So... just to clarify for folks not familiar: Reproducible/deterministic builds[1] are a security paradigm. The idea is that the source code determines the output product in a provable way, such that people can audit the resulting binaries to detect any modification. That is, it closes the "inside actor" hole where someone with access to build infrastructure can do things like insert backdoors. It's a good thing, even if us hackers at the other end of the source chain will never touch it nor care.

And for traditional software, it's actually really difficult to do. But as it happens, Zephyr does it pretty well already. I don't really see much reason not to have this on for normal apps. Users who don't understand it won't be harmed, and might just be saved if they keep their build logs. The only thing to argue here in my mind is polish: it would be nice if this had a name other than "zephyr.strip". It might be nice to define a more formal form of the output vs. just stuffing it in a build log and sha256sum file. It would be nice to have a section in the docs detailing "what you have to do to secure your reproducible build", etc...

But really, we should merge this. Potentially with a workaround to turn it off in CI if it turns out to be a measurable performance problem.

[1] You sometimes see the term "hermetic builds" or "hermeticity" in the same context, which refers to the practice of ensuring no "outside" software gets run in the context of a build (by analogy to a "hermetic seal" in food packaging).

ceolin · 2022-11-04T16:27:13Z

i'm sure this is a very useful feature, just like printing of memory usage.

So... just to clarify for folks not familiar: Reproducible/deterministic builds[1] are a security paradigm. The idea is that the source code determines the output product in a provable way, such that people can audit the resulting binaries to detect any modification. That is, it closes the "inside actor" hole where someone with access to build infrastructure can do things like insert backdoors. It's a good thing, even if us hackers at the other end of the source chain will never touch it nor care.

And for traditional software, it's actually really difficult to do. But as it happens, Zephyr does it pretty well already. I don't really see much reason not to have this on for normal apps. Users who don't understand it won't be harmed, and might just be saved if they keep their build logs. The only thing to argue here in my mind is polish: it would be nice if this had a name other than "zephyr.strip". It might be nice to define a more formal form of the output vs. just stuffing it in a build log and sha256sum file. It would be nice to have a section in the docs detailing "what you have to do to secure your reproducible build", etc...

But really, we should merge this. Potentially with a workaround to turn it off in CI if it turns out to be a measurable performance problem.

[1] You sometimes see the term "hermetic builds" or "hermeticity" in the same context, which refers to the practice of ensuring no "outside" software gets run in the context of a build (by analogy to a "hermetic seal" in food packaging).

+1 that is exactly what I talked with Marc

stephanosio · 2022-11-05T11:36:22Z

I do not see any problem with enabling this (printing checksum) by default.

The part that is problematic is how it currently works -- i.e. why does enabling CONFIG_BUILD_OUTPUT_STRIPPED automatically enable checksum and disabling BUILD_OUTPUT_STRIPPED automatically disable checksum?

"Stripping" and "computing/printing checksum" are really two orthogonal features and should be controlled by separate configs.

nordicjm · 2022-11-06T15:13:27Z

Kconfig.zephyr

@@ -575,6 +575,7 @@ endif # BUILD_OUTPUT_UF2

 config BUILD_OUTPUT_STRIPPED
 	bool "Build a stripped binary"
+	default y


As per other commit, rejecting as this is not something that should ever be enabled by default

tejlmand · 2022-11-07T12:33:42Z

Reproducible/deterministic builds[1] are a security paradigm. The idea is that the source code determines the output product in a provable way, such that people can audit the resulting binaries to detect any modification.

please note, i'm not against the requested functionality, as I believe that is very useful.

I dislike the current proposal.

We should not just print info into log files just because someone needs this information and knows where to look for it.
Doing so will lead us to a mess on where to find which information.

Also we should not link this to stripped elf files.
You can have stripped files which generates different shas but contain identical binary code.
See also: #51737 (comment)

We already create bin files per default:

zephyr/Kconfig.zephyr

Lines 491 to 493 in b052143

    
           config BUILD_OUTPUT_BIN 
        
           	bool "Build a binary in BIN format" 
        
           	default y

It would be more correct to do a sha on the bin file, compared to doing it on the stripped elf, although I believe a different approach should be taken.

Instead of producing stripped elf files per default and print sha as build information which must be parsed in build log, then we should really try to align how we collect such important information and provide it to users.

In my opinion, a much better approach is to extend the build meta output to not only contain revision information, but extend it to contain information from the produced artifacts.

zephyr/Kconfig.zephyr

Lines 629 to 644 in b052143

    
           config BUILD_OUTPUT_META 
        
           	bool "Create a build meta file" 
        
           	help 
        
           	  Create a build meta file in the build directory containing lists of: 
        
           	  - Zephyr: path and revision (if git repo) 
        
           	  - Zephyr modules: name, path, and revision (if git repo) 
        
           	  - West: 
        
           	    - manifest: path and revision 
        
           	    - projects: path and revision 
        
           	  - Workspace: 
        
           	    - dirty: one or more repositories are marked dirty 
        
           	    - extra: extra Zephyr modules are manually included in the build 
        
           	    - off:   the SHA of one or more west projects are not what the manifest 
        
           	             defined when `west update` was run the last time (`manifest-rev`). 
        
           	             The off state is only present if a west workspace is found. 
        
           	  File extension is .meta

Extending this meta file with checksum of generated files will truly provide users with information on how to reproduce a build.
(together with the .config and CMake cache files).

That would be checksum of artifacts like:

checksum of the elf file
Checksum of following files if produced by the build system
- checksum of stripped elf file
- checksum of bin file
- checksum of hex file
- checksum of s19 file
- etc.

are all important information which can go into the meta file, along the current revisions.

that will give a uniform way of obtaining information for reproducing a build and the ability to verify that the build was indeed identical to the original.

The knowledge of this meta file is even in the docs, although we can definitely be better at exposing this feature.
https://docs.zephyrproject.org/latest/kconfig.html#CONFIG_BUILD_OUTPUT_META

marc-hb · 2022-11-09T00:09:15Z

Thanks everyone for the reviews. I've been drawn into some more urgent task, should get back to this next week at the latest.

The discussion in zephyrproject-rtos/zephyr#51954 may take time and we need reproducibility last year (who doesn't?) Signed-off-by: Marc Herbert <[email protected]>

The discussion in zephyrproject-rtos/zephyr#51954 may take time and we need reproducibility last year (who doesn't?) Signed-off-by: Marc Herbert <[email protected]> (cherry picked from commit 945adb8)

github-actions · 2023-01-08T00:29:43Z

This pull request has been marked as stale because it has been open (more than) 60 days with no activity. Remove the stale label or add a comment saying that you would like to have the label removed otherwise this pull request will automatically be closed in 14 days. Note, that you can always re-open a closed pull request at any time.

Make CONFIG_BUILD_OUTPUT_STRIPPED mandatory so we can always compare builds. It makes practically zero build space and time difference and has huge reproductibility value, see discussion in zephyrproject-rtos/zephyr#51954 Signed-off-by: Marc Herbert <[email protected]>

marc-hb · 2024-12-19T17:46:55Z

We should not just print info into log files just because someone needs this information and knows where to look for it.

In theory, reproducing someone else's environment is easy.
https://martinfowler.com/articles/continuousIntegration.html#PutEverythingInAVersionControlledMainline

In practice it's often a nightmare. Extensive real-world experience shows that build logs is the only thing you can really rely on:

scripts/xtensa-build-zephyr.py: Allow for alternate toolchain versions thesofproject/sof#9736 (comment)

Unless maybe you work in some ideal place where the build is ruled with an iron fist because the CEO is a former automation engineer? :-D

"Stripping" and "computing/printing checksum" are really two orthogonal features and should be controlled by separate configs.

In theory, they're conceptually different. In practice they're the same.

Thanks to stripping (and other things), Linux and Windows builds have been successfully compared in SOF(Zephyr) CI for 2 years now: thesofproject/sof@35dda2a

That automation has successfully found a few build reproducibility issues in Zephyr, see 2023 and 2024 issues at the bottom of this long list: #14593

CONFIG_BUILD_OUTPUT_STRIPPED has been a fantastic success. It just works.

marc-hb added 2 commits November 4, 2022 01:46

This was referenced Nov 4, 2022

Verify builds are reproducible in the CI #50205

Open

scripts: gen_handles: Sort the device handles #48195

Merged

marc-hb marked this pull request as ready for review November 4, 2022 03:36

marc-hb requested review from tejlmand and nashif as code owners November 4, 2022 03:36

zephyrbot added the area: Build System label Nov 4, 2022

zephyrbot requested a review from jeremybettis November 4, 2022 03:37

zephyrbot assigned tejlmand Nov 4, 2022

marc-hb requested review from kv2019i and lgirdwood November 4, 2022 04:08

marc-hb mentioned this pull request Nov 4, 2022

SOF v2.3 thesofproject/sof-bin#106

Merged

marc-hb requested a review from aborisovich November 4, 2022 04:11

aborisovich approved these changes Nov 4, 2022

View reviewed changes

dbaluta self-requested a review November 4, 2022 09:56

dbaluta approved these changes Nov 4, 2022

View reviewed changes

dbaluta reviewed Nov 4, 2022

View reviewed changes

tejlmand requested changes Nov 4, 2022

View reviewed changes

tejlmand reviewed Nov 4, 2022

View reviewed changes

andyross approved these changes Nov 4, 2022

View reviewed changes

ceolin approved these changes Nov 4, 2022

View reviewed changes

marc-hb mentioned this pull request Nov 4, 2022

xtensa-build-zephyr.py: install and checksum .elf, .lst and other files thesofproject/sof#6501

Merged

nordicjm requested changes Nov 6, 2022

View reviewed changes

marc-hb mentioned this pull request Jan 7, 2023

west sign: imgtool: zephyr.signed.hex and zephyr.signed.bin do not have the same contents #52271

Closed

github-actions bot added the Stale label Jan 8, 2023

github-actions bot closed this Jan 22, 2023

johnandersen777 mentioned this pull request Feb 28, 2023

service: sw: src: change: notify: Service to facilitate poly repo pull model dev tooling intel/dffml#1315

Open

2 tasks

marc-hb mentioned this pull request Mar 20, 2024

list_hardware.py: sort rglob(SOC_YML) HWMv2 results #70132

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cmake: compute and display the reproducible checksum by default #51954

cmake: compute and display the reproducible checksum by default #51954

marc-hb commented Nov 4, 2022 •

edited

Loading

johnandersen777 commented Nov 4, 2022 •

edited

Loading

dbaluta Nov 4, 2022

tejlmand Nov 4, 2022

marc-hb Nov 4, 2022 •

edited

Loading

tejlmand Nov 4, 2022

stephanosio Nov 4, 2022

andyross left a comment

andyross commented Nov 4, 2022

ceolin commented Nov 4, 2022

stephanosio commented Nov 5, 2022

nordicjm Nov 6, 2022

tejlmand commented Nov 7, 2022 •

edited

Loading

marc-hb commented Nov 9, 2022 •

edited

Loading

github-actions bot commented Jan 8, 2023

marc-hb commented Dec 19, 2024

	config OUTPUT_PRINT_MEMORY_USAGE
	bool "Print memory usage to stdout"

cmake: compute and display the reproducible checksum by default #51954

cmake: compute and display the reproducible checksum by default #51954

Conversation

marc-hb commented Nov 4, 2022 • edited Loading

johnandersen777 commented Nov 4, 2022 • edited Loading

dbaluta Nov 4, 2022

Choose a reason for hiding this comment

tejlmand Nov 4, 2022

Choose a reason for hiding this comment

marc-hb Nov 4, 2022 • edited Loading

Choose a reason for hiding this comment

tejlmand Nov 4, 2022

Choose a reason for hiding this comment

stephanosio Nov 4, 2022

Choose a reason for hiding this comment

andyross left a comment

Choose a reason for hiding this comment

andyross commented Nov 4, 2022

ceolin commented Nov 4, 2022

stephanosio commented Nov 5, 2022

nordicjm Nov 6, 2022

Choose a reason for hiding this comment

tejlmand commented Nov 7, 2022 • edited Loading

marc-hb commented Nov 9, 2022 • edited Loading

github-actions bot commented Jan 8, 2023

marc-hb commented Dec 19, 2024

marc-hb commented Nov 4, 2022 •

edited

Loading

johnandersen777 commented Nov 4, 2022 •

edited

Loading

marc-hb Nov 4, 2022 •

edited

Loading

tejlmand commented Nov 7, 2022 •

edited

Loading

marc-hb commented Nov 9, 2022 •

edited

Loading