Unexpected allocations reported in CPU-bound code #1542

ronbrogan · 2020-09-30T05:31:23Z

I'm trying to validate that code doesn't allocate anything and I have some unit tests asserting on the summary provided from running benchmarks.

However, I'm seeing a non-deterministic amount of these test runs fail due to allocations sometimes being reported for a given benchmark - and sometimes not.

My method under test in the real world application takes about 20ms per op (CPU bound, no allocations), so my repro here has a dummy loop to simulate the work.

These benchmarks are all invoking the same method, there are multiple just to illustrate that the same code can yield different allocation results.

Source for the repro is here:
https://gist.github.com/ronbrogan/bd53bddd76cfb878eef0ae0a683434df

My only line of reasoning right now is that this is due to the minimum allocation size leaking over into the measured allocations, but I still don't understand why this couldn't be avoided.

In the repro I use GC.GetAllocatedBytesForCurrentThread before/after running my method and there is no difference. Is this an issue with MemoryDiagnoser, user error, or is it simply not reasonable to try to assert that a given benchmark makes 0 allocations?

The text was updated successfully, but these errors were encountered:

adamsitnik · 2020-10-01T09:49:15Z

Hi @ronbrogan

Big thanks for a very detailed bug report with a simple repro case!

I was able to reproduce it for .NET Core 3.1. 2.1 and 5.0 are free of this bug, I will dig deeper and get back to you

adamsitnik · 2020-10-01T12:12:29Z

Ok, this is most probably a side-effect of Tiered JIT which allocates something on the other Thread.

To test it I set the following env var: COMPlus_TieredCompilation:0

adamsitnik · 2020-10-01T14:21:26Z

I've confirmed that it's Tiered JIT background thread:

…iteration, the Tiered JIT might kick-in and allocate some memory and affect the results as a workaround, we can put the thread to sleep for more than 200ms to TC thread kicks in before we start memory measurements it's far from perfect but it works fixes #1542

Turnerj · 2020-10-28T07:45:21Z

I noticed something like this randomly happening in my benchmarks too for a little while, thought it was something weird with my code. Transitioned some allocate-y code to use ArrayPool and it was sometimes allocating a small number of bytes and sometimes not at all - my code is otherwise CPU bound like the OP.

Just quickly jumping through the thread in your PR @adamsitnik , is one potential "quick fix" solution to simply benchmark with the latest .NET 5? (Currently using 3.1 but in my case, can switch to just .NET 5 RC2 easy enough) Edit: Misread your earlier comment, thought you wrote that 3.1, 2.1 and 5.0 all had the bug.

For my own curiousity - why would there be an allocation by the JIT during the diagnoser run? I would have thought the workload and overhead JIT runs would have done everything including any allocations that they may have needed. Is it just that the tiered JIT process can happen outside of the dedicated time that BDN sets for jitting? (I have no knowledge how all that logic is done under-the-hood in BDN so I'm probably missing something obvious)

Turnerj · 2020-10-28T14:14:28Z

Just got around to running my benchmark on .NET 5, it does seem to be allocating still for me.

Code base: https://github.com/Turnerj/LevenshteinBenchmarks/tree/2475940db8c4c6f7727c20d5a3ba20a200e77e5c
The specific implementation that shouldn't allocate: https://github.com/Turnerj/LevenshteinBenchmarks/blob/2475940db8c4c6f7727c20d5a3ba20a200e77e5c/Implementations/03_ArrayPool.cs

Just run the "ArrayPool" benchmark to see the results. My use of ArrayPool is well below the 1,048,576 item limit so I don't understand where the allocations are coming from besides something wrong in the diagnoser or the runtime itself.

timcassell · 2020-12-02T07:57:50Z

See issue dotnet/runtime#45446

Even though I'm measuring differently there (total bytes instead of allocations), I think the problem is the same. It's affected in both .NET Core 3.1 and .NET 5.0 in my tests (5.0 is worse).

…olchain as it suffers from #1542 (Tiered JIT allocating memory in background)

* use httpS * update project files to net5.0 * don't use CLASSIC and CORE #if defines * enable ThreadingDiagnoserTests tests that were disabled so far (APIs not available prior to .NET Core 3) * update samples * update remaining tests * get warnings to 0 * update build scripts * fix MultipleRuntimesTest (.NET Core is not called Core anymore ;) ) * disable ThreadingDiagnoserTests for the InProcessToolchain * disable the failing CoreRT tests * disable some MemoryDiagnoser tests due to #1542

adamsitnik added the Area:Diagnosers label Oct 1, 2020

adamsitnik self-assigned this Oct 1, 2020

adamsitnik mentioned this issue Oct 1, 2020

Memory diagnoser fix for Tiered Compilation #1543

Closed

adamsitnik mentioned this issue Nov 6, 2020

Measure loop alignment's performance impact on Microbenchmarks dotnet/runtime#44051

Closed

adamsitnik mentioned this issue Nov 19, 2020

fix issue #1561 #1600

Merged

petarpetrovt mentioned this issue Nov 26, 2020

Benchmarks give inconsistent memory results petarpetrovt/sorting-networks#26

Open

adamsitnik added a commit that referenced this issue Jan 12, 2021

disable AllocationQuantumIsNotAnIssueForNetCore21Plus for InProcessTo…

0c6c076

…olchain as it suffers from #1542 (Tiered JIT allocating memory in background)

adamsitnik added a commit that referenced this issue Jan 12, 2021

disable two MemoryDiagnoser test due to #1542

e5d30e4

adamsitnik mentioned this issue Mar 31, 2021

when all builds fail, BDN should stop after printing first error #1672

Merged

adamsitnik mentioned this issue Jun 8, 2023

[MemoryDiagnoser] Shows strange values for 'almost' empty benchmarks #2321

Closed

This was referenced Aug 15, 2023

Why does an empty method report allocating bytes? #2402

Closed

[MemoryDiagnoser] inaccurate and influenced by [GlobalSetup] work #1599

Closed

timcassell linked a pull request Apr 15, 2024 that will close this issue

Improve memory diagnoser accuracy #2562

Open

timcassell assigned timcassell and unassigned adamsitnik Apr 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unexpected allocations reported in CPU-bound code #1542

Unexpected allocations reported in CPU-bound code #1542

ronbrogan commented Sep 30, 2020

adamsitnik commented Oct 1, 2020

adamsitnik commented Oct 1, 2020

adamsitnik commented Oct 1, 2020

Turnerj commented Oct 28, 2020 •

edited

Loading

Turnerj commented Oct 28, 2020

timcassell commented Dec 2, 2020

Unexpected allocations reported in CPU-bound code #1542

Unexpected allocations reported in CPU-bound code #1542

Comments

ronbrogan commented Sep 30, 2020

adamsitnik commented Oct 1, 2020

adamsitnik commented Oct 1, 2020

adamsitnik commented Oct 1, 2020

Turnerj commented Oct 28, 2020 • edited Loading

Turnerj commented Oct 28, 2020

timcassell commented Dec 2, 2020

Turnerj commented Oct 28, 2020 •

edited

Loading