Consider using stackalloc for string.Split #6266

jamesqo · 2016-07-06T08:38:51Z

Right now, string.Split allocates a new int array for both the char and string versions. We might want to consider using stackalloc here instead for strings of lengths under a certain threshold.

Alternatively, we could introduce some type of ArrayPool-like API to mscorlib and use that, or just forego allocating anything altogether and simply count the length needed for the resulting array on our first pass.

The text was updated successfully, but these errors were encountered:

gkhanna79 · 2017-03-06T05:03:39Z

CC @danmosemsft @AlexGhiondea

AlexGhiondea · 2017-03-06T05:06:31Z

Sounds like a great idea for an enhancement. Anyone interested in picking this up?

4real · 2017-03-24T17:17:39Z

I'm looking into it, seems like a nice introduction to coreclr.

danmoseley · 2017-03-24T18:17:06Z

@karelz can you please give @4real this issue

@4real whatever you do will need a fair bit of performance measurement to find and prove the best approach on various inputs. We usually use Benchmark.NET.

karelz · 2017-03-24T18:31:52Z

@4real I sent you the invite, please ping us when you accept.

@danmosemsft I thought you have same admin permissions as I do ... we should fix that ;-)

4real · 2017-03-25T00:31:06Z

@karelz Accepted invite!

@danmosemsft Thanks, I'll have a look at Benchmark.NET. I take it that sufficient proof is empirical evidence a new solution is significantly faster and there being a line of reasoning as to why that is? Should I write a performance test for this issue specifically? (Apologies if this is covered by the documentation, I haven't processed it all yet.)

danmoseley · 2017-03-25T00:38:49Z

@4real there aren't really hard rules but generally we're looking for

parity or faster across all interesting ranges of inputs
improvement is worth any increase in code complexity and maintenance cost. This depends on how heavily used/critical the API is. We have taken and will continue to take quite hairy changes eg in Encoding or StringBuilder because those show up in key performance scenarios. In something that's less performance critical we are more wary of accepting code complexity. Of course lower in the stack, the worse it is if we have a bad impact on some scenario. Sometimes changes take a long time to get right and often they don't work out and have to be aborted.
no affect on functional behavior and acceptably low risk of breaking something

Of course there are plenty of perf changes where it's a no brainer and sometimes we don't even bother measuring (eg just removing redundant code).

danmoseley · 2017-03-25T00:40:50Z

Incidentally for profiling you can use the Visual Studio 2017 profiler but many of us use PerfView as it's owned by our team. It's very powerful and has good tutorials.

4real · 2017-03-25T22:30:55Z

Thanks, I'll have a look at PerfView too.

From what I can tell, there are XUnit.Performance tests in the codebase for the GC and JIT, but not mscorlib. So instead, I will create a Benchmark.NET performance test for string.Split which will not be included in the commit, but for which I will post the results of here - does this seem like the correct approach? I will however create a functional test to show correctness of changes to string.Split if deemed not covered by existing tests.

danmoseley · 2017-03-27T16:24:58Z

@4real correct, that's what we normally do -- have a throwaway perf test -- of course you could archive it in a gist with a link in the PR in case we want it again.

There are lots of XUnit.Performance tests in the repo and they are run regularly but those are to catch regressions and there is a certain cost to maintaining them (watching results). So we could consider adding one if it was an important scenario that might regress. Normally we don't. In fact we may delete some of those tests -- they need an audit.

danmoseley · 2017-11-23T04:48:51Z

@4real still planning to take a look? just realized I never checked back. it would be nice to optmize this.

lkts · 2017-11-30T06:56:33Z

I would like to give it a try if nobody is looking

4real · 2017-11-30T07:15:58Z

Hello, feel free to take it over, I've just realized how much time this has been resting. My attention has been shifted elsewhere unfortunately and I wouldn't start working on this within reasonable time.

karelz · 2017-11-30T16:45:58Z

@cod7alex I sent you Collaborator invite - that will allow to assign the issue to you (GH limitation). Ping me when you accept.
ProTip: Accepting will automatically subscribe you to all notifications. There's plenty (500+ per day). We recommend to disable them and use just notifications for your mentions and explicit subscriptions to issues.

danmoseley · 2017-11-30T17:39:29Z

@cod7alex to restate the above, we typically use PerfView to profile, Benchmark.NET to prove the win (post results here).

To run corefx tests (where most tests are) against changes made in coreclr (where string is) look at https://github.com/dotnet/corefx/blob/master/Documentation/project-docs/developer-guide.md#testing-with-private-coreclr-bits to see the flag to force this to happen.

If the benchmark results and change is satisfactory we will want to check in benchmark tests.Currently we still use xunit.perf for checked-in tests (that may change) - see existing tests in corefx\src\System.Runtime\tests\Performance\Perf.String.cs. This may be helpful
https://github.com/dotnet/corefx/blob/master/Documentation/project-docs/performance-tests.md

I think we need better docs on using Benchmark.NET against private coreclr changes, based on https://github.com/dotnet/corefx/blob/master/Documentation/project-docs/dogfooding.md. Hopefully @ViktorHofer can throw up some notes he has. We can also help answer questions.

lkts · 2017-11-30T18:25:41Z

thank you @danmosemsft

ViktorHofer · 2017-11-30T20:02:02Z

BenchmarkDotNet instructions not yet reviewed by someone else available here: dotnet/corefx#25612

lkts · 2017-12-02T09:54:22Z

@ViktorHofer this guide is very helpful, i was able to run benchmark.
First thing i have done was to use stackalloc for int array when string length is less than 512.
I must have done something wrong, will attach benchmarks later.

lkts · 2017-12-06T19:25:09Z

Can someone help me with Spans? If i have the code like the following, can i pass Span to function and fill it and what is the way to do it if yes? Is it even a good idea?

private unsafe Span<int> CreateSeparatorList(int length)
{
    if (Length < 512)
    {
        int* stackBuffer = stackalloc int[length];
        return new Span<int>(stackBuffer, length);
    }
    return new int[length];
}

Rattenkrieg · 2017-12-06T20:28:13Z

@cod7alex this is explicitly forbidden by design of Span. Memory it points will be reclaimed when you leave CreateSeparatorList's activation record. Spans (actually anything stacalloced) can only be passed from callers to callees.

lkts · 2017-12-06T20:46:29Z

thanks @Rattenkrieg, I am not really familiar with stackalloc. So it means I should inline this everywhere it is needed and return the filled span from methods?

Rattenkrieg · 2017-12-06T20:50:12Z

@cod7alex if you know desired size in advance you can allocate span in top level method and pass it by ref where it will be populated.
if you will ever hit unexplainable performance degradation caused by stackalloc this may shed some light: #6122 dotnet/coreclr#8534 #4384

lkts · 2017-12-07T07:16:43Z

@Rattenkrieg many thanks

jamesqo · 2017-12-07T13:52:31Z

@cod7alex By the way, you don't have to stackalloc a temporary ptr and then create a span from that. You can just stackalloc a Span directly. https://github.com/dotnet/corefx/pull/24212/files

lkts · 2017-12-07T20:22:56Z

@jamesqo yes, i have found that information recently. Thank you.

karelz assigned danmoseley Mar 24, 2017

danmoseley assigned 4real and unassigned danmoseley Mar 25, 2017

danmoseley assigned lkts and unassigned 4real Nov 30, 2017

jkotas closed this as completed in dotnet/coreclr#15435 Feb 5, 2018

msftgits transferred this issue from dotnet/coreclr Jan 31, 2020

msftgits added this to the Future milestone Jan 31, 2020

CarolEidt mentioned this issue Oct 27, 2020

[RyuJIT] Eliminate unecessary copies when passing structs #9839

Closed

ghost locked as resolved and limited conversation to collaborators Dec 30, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consider using stackalloc for string.Split #6266

Consider using stackalloc for string.Split #6266

jamesqo commented Jul 6, 2016

gkhanna79 commented Mar 6, 2017

AlexGhiondea commented Mar 6, 2017

4real commented Mar 24, 2017

danmoseley commented Mar 24, 2017

karelz commented Mar 24, 2017 •

edited

Loading

4real commented Mar 25, 2017 •

edited

Loading

danmoseley commented Mar 25, 2017 •

edited

Loading

danmoseley commented Mar 25, 2017

4real commented Mar 25, 2017

danmoseley commented Mar 27, 2017

danmoseley commented Nov 23, 2017

lkts commented Nov 30, 2017

4real commented Nov 30, 2017

karelz commented Nov 30, 2017

danmoseley commented Nov 30, 2017

lkts commented Nov 30, 2017

ViktorHofer commented Nov 30, 2017 •

edited

Loading

lkts commented Dec 2, 2017 •

edited

Loading

lkts commented Dec 6, 2017 •

edited

Loading

Rattenkrieg commented Dec 6, 2017

lkts commented Dec 6, 2017 •

edited

Loading

Rattenkrieg commented Dec 6, 2017

lkts commented Dec 7, 2017

jamesqo commented Dec 7, 2017

lkts commented Dec 7, 2017

Consider using stackalloc for string.Split #6266

Consider using stackalloc for string.Split #6266

Comments

jamesqo commented Jul 6, 2016

gkhanna79 commented Mar 6, 2017

AlexGhiondea commented Mar 6, 2017

4real commented Mar 24, 2017

danmoseley commented Mar 24, 2017

karelz commented Mar 24, 2017 • edited Loading

4real commented Mar 25, 2017 • edited Loading

danmoseley commented Mar 25, 2017 • edited Loading

danmoseley commented Mar 25, 2017

4real commented Mar 25, 2017

danmoseley commented Mar 27, 2017

danmoseley commented Nov 23, 2017

lkts commented Nov 30, 2017

4real commented Nov 30, 2017

karelz commented Nov 30, 2017

danmoseley commented Nov 30, 2017

lkts commented Nov 30, 2017

ViktorHofer commented Nov 30, 2017 • edited Loading

lkts commented Dec 2, 2017 • edited Loading

lkts commented Dec 6, 2017 • edited Loading

Rattenkrieg commented Dec 6, 2017

lkts commented Dec 6, 2017 • edited Loading

Rattenkrieg commented Dec 6, 2017

lkts commented Dec 7, 2017

jamesqo commented Dec 7, 2017

lkts commented Dec 7, 2017

karelz commented Mar 24, 2017 •

edited

Loading

4real commented Mar 25, 2017 •

edited

Loading

danmoseley commented Mar 25, 2017 •

edited

Loading

ViktorHofer commented Nov 30, 2017 •

edited

Loading

lkts commented Dec 2, 2017 •

edited

Loading

lkts commented Dec 6, 2017 •

edited

Loading

lkts commented Dec 6, 2017 •

edited

Loading