Minor changes to improve compilation speed #137

Iterator combinators have a noticable compilation time overhead because they must be monomorphized and end up passing more code to LLVM. This code does get optimized out, but that takes time and slows down the overall build.

I'm not sure how much of a compile time impact this makes, but this prevents the compiler from having to generate formatting code when the debug macro is not in use.

The current implementation uses a HashMap to deduplicate the output from each core of the same cpu. This commit instead collects the output for each core in to a Vec, and then sorts it to deduplicate physical CPUs. This reduces the code size processed by LLVM by 15-20%, as counted by `cargo llvm-lines --lib -p num_cpus` on both debug and release. These implementations have different performance characteristics: - the HashMap must hash each key, and SipHash is slow on small keys - the number of cores will be small (<1024) so sorting the list should be very fast - the list will likely already be sorted I have not benchmarked this code, but it should be around the same speed or slightly faster (from testing against randomized lists).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Minor changes to improve compilation speed #137

Minor changes to improve compilation speed #137

Commits on Feb 25, 2024

Minor changes to improve compilation speed #137

Are you sure you want to change the base?

Minor changes to improve compilation speed #137

Commits on Feb 25, 2024