build: add load average limit to reduce CPU overcommitment #4094

dhewg · 2020-01-05T10:57:56Z

For make and ninja based build systems, no new jobs are started if the load
average is greater than number_of_cores * 1.5.

Building an image on a 8-Core CPU, so using a load limit of 10.00 (earlier version of the patch which used * 1.25):

Before	After

(that's with building on tmpfs and autoremove)

MilhouseVH · 2020-01-05T15:19:53Z

[ -z "${CONCURRENCY_MAKE_LEVEL}" ] && export CONCURRENCY_MAKE_LEVEL=$(nproc)
[ -z "${CONCURRENCY_LOAD}" ] && export CONCURRENCY_LOAD=$(python3 -c "import os; print('%.2f' % (os.cpu_count() * 1.25))")

Is (or maybe, should?) CONCURRENCY_LOAD be linked to CONCURRENCY_MAKE_LEVEL?

For instance, if the user forces CONCURRENCY_MAKE_LEVEL to be 4 (on an 8 core CPU) should the defaultCONCURRENCY_LOAD then become 4 * 1.25, otherwise they'll still be over-commited by default (at 10)?

In which case we could use this:

[ -z "${CONCURRENCY_MAKE_LEVEL}" ] && export CONCURRENCY_MAKE_LEVEL=$(nproc)
[ -z "${CONCURRENCY_LOAD}" ] && export CONCURRENCY_LOAD=$(echo "scale=2; ${CONCURRENCY_MAKE_LEVEL} * 1.25" | bc)

Also, I'd like to allow a way to completely disable CONCURRENCY_LOAD, for example CONCURRENCY_LOAD=0 could mean we don't apply -l at all.

MilhouseVH · 2020-01-05T23:39:50Z

With this PR I think we can now lose this line, as it's redundant:

LibreELEC.tv/config/optimize

Line 4 in 254dedf

NINJA_OPTS=""

MilhouseVH · 2020-01-06T00:35:26Z

Could we also replace NINJA_OPTS with NINJA_FLAGS (exporting the latter, then removing from scripts/build, and mariadb and llvm packages), as this is functionally equivalent to MAKEFLAGS: ninja-build/ninja#1399

Alternatively I'd be happy for this to be addressed in a follow-up PR.

dhewg · 2020-01-06T09:57:40Z

dropped the NINJA_OPTS line from config/optimize
allow CONCURRENCY_LOAD=0 to disabled load limiting
use number_of_cores * 1.5 per default, as build times regressed on some setups

dhewg · 2020-01-06T09:59:14Z

I didn't use NINJA_FLAGS since that isn't upstream and it looks like it won't get merged

HiassofT · 2020-01-06T14:45:20Z

I did a test on my laptop (i7-3740QM, 16GB RAM) and RPi4 build time with this PR was about the same as for plain master (01:26:54.453 vs 01:27:43.468 for master).

I recorded available memory (MemAvailable from /proc/meminfo) and load (1minute average, first value from /proc/loadavg) and it looks available memory is nothing we need to worry about much - at least with the 8 logical CPU cores I tested with (could be different for 32/64 core systems). Max memory usage was about 4GB in both cases, with the PR the memory usage peak near the end of the build wasn't observable though.

MilhouseVH · 2020-01-06T18:12:19Z

I didn't use NINJA_FLAGS since that isn't upstream and it looks like it won't get merged

Yeah, thanks for being on the ball - I looked at the ninja PR and thought to myself "this seems like a no-brainer" and that it had been merged. Little did I know the drama that was to follow, including an actual fork. Crazy. :(

dhewg · 2020-01-14T16:23:17Z

Updated screen, mame and mame2016 patches.
Because we need to patch that horrific genie we need to fix it's compilation and because of that the cross compilation.

Don't set PTR64=0 on 32bit archs, instead set ARCHITECTURE to an empty string. That way we don't need to patch out hardcoded -m32 arguments. While at it, disable the bgfx hw renderers, or that pile of build crap tries to include X11 headers.

Yay, more missing dependencies: In file included from ../../../../../src/mame/audio/taito_zm.h:14, from ../../../../../src/mame/drivers/zn.cpp:15: ../../../../../src/devices/cpu/tms57002/tms57002.h:208:10: fatal error: ../../emu/cpu/tms57002/tms57002.hxx: No such file or directory

Replace the current patch with the same approach as libretro-mame: Don't set PTR64=0 on 32bit archs, instead set ARCHITECTURE to an empty string. That way we don't need to patch out hardcoded -m32 arguments. While at it, disable the bgfx hw renderers, or that pile of build crap tries to include X11 headers.

The make dependecies are a mess, take no chances.

Occasionally it attempts to link the plugin against the library before the library is linked.

For make and ninja based build systems, no new jobs are started if the load average is greater than number_of_cores * 1.5.

eli-schwartz · 2020-03-11T19:08:21Z

Yeah, thanks for being on the ball - I looked at the ninja PR and thought to myself "this seems like a no-brainer" and that it had been merged. Little did I know the drama that was to follow, including an actual fork. Crazy. :(

The independent Ninja build language reimplementation in C, at https://github.com/michaelforney/samurai, supports this via the SAMUFLAGS environment variable.

MilhouseVH · 2020-03-11T19:13:19Z

@eli-schwartz

😄

dhewg force-pushed the pull/load branch from e9550a3 to 44c47e9 Compare January 6, 2020 09:49

dhewg force-pushed the pull/load branch 5 times, most recently from 493c3de to 4832383 Compare January 11, 2020 14:52

dhewg force-pushed the pull/load branch 3 times, most recently from d40ee77 to 0c4d093 Compare January 16, 2020 07:44

dhewg added 13 commits January 17, 2020 09:01

bash: work around broken make dependencies

3b9afbd

libretro-mame: fix cross compilation

351e23a

Don't set PTR64=0 on 32bit archs, instead set ARCHITECTURE to an empty string. That way we don't need to patch out hardcoded -m32 arguments. While at it, disable the bgfx hw renderers, or that pile of build crap tries to include X11 headers.

libretro-mame: fix make dependencies

0790e55

libretro-mame2016: fix make dependencies

5b60d3a

libretro-mame2016: pre-create generated headers

776c96e

libretro-mame2010: work around broken make dependencies

369b40d

libretro-mame2015: work around broken make dependencies

7b53760

tstools: work around broken make dependencies

16e6e84

screen: disable parallel build

ed32c16

The make dependecies are a mess, take no chances.

libirman: disable parallel build

f58d0c7

Occasionally it attempts to link the plugin against the library before the library is linked.

build: add load average limit to reduce CPU overcommitment

2d12b7e

For make and ninja based build systems, no new jobs are started if the load average is greater than number_of_cores * 1.5.

dhewg force-pushed the pull/load branch from 0c4d093 to 2d12b7e Compare January 17, 2020 08:17

HiassofT approved these changes Jan 20, 2020

View reviewed changes

HiassofT merged commit 2cb65bb into LibreELEC:master Jan 20, 2020

dhewg deleted the pull/load branch January 21, 2020 06:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

build: add load average limit to reduce CPU overcommitment #4094

build: add load average limit to reduce CPU overcommitment #4094

dhewg commented Jan 5, 2020 •

edited

Loading

MilhouseVH commented Jan 5, 2020

MilhouseVH commented Jan 5, 2020 •

edited

Loading

MilhouseVH commented Jan 6, 2020

dhewg commented Jan 6, 2020

dhewg commented Jan 6, 2020

HiassofT commented Jan 6, 2020

MilhouseVH commented Jan 6, 2020

dhewg commented Jan 14, 2020

eli-schwartz commented Mar 11, 2020

MilhouseVH commented Mar 11, 2020

build: add load average limit to reduce CPU overcommitment #4094

build: add load average limit to reduce CPU overcommitment #4094

Conversation

dhewg commented Jan 5, 2020 • edited Loading

MilhouseVH commented Jan 5, 2020

MilhouseVH commented Jan 5, 2020 • edited Loading

MilhouseVH commented Jan 6, 2020

dhewg commented Jan 6, 2020

dhewg commented Jan 6, 2020

HiassofT commented Jan 6, 2020

MilhouseVH commented Jan 6, 2020

dhewg commented Jan 14, 2020

eli-schwartz commented Mar 11, 2020

MilhouseVH commented Mar 11, 2020

dhewg commented Jan 5, 2020 •

edited

Loading

MilhouseVH commented Jan 5, 2020 •

edited

Loading