Type test `List<T>` in Enumerable.SequenceEquals and forward to `MemoryExtensions.SequenceEqual(span...)` #97004

neon-sunset · 2024-01-15T19:59:19Z

Type test List<T> in Enumerable.SequenceEqual and forward to MemoryExtensions.SequenceEqual(span...), move the loops to local functions to allow the method to get inlined and type tests get optimized away.

Seems like a simple performance win that was accidentally missed when CollectionsMarshal.AsSpan was added.

Method	Length	Mean	Error	StdDev	Ratio	RatioSD
CompareLists	0	8.911 ns	0.0204 ns	0.0181 ns	1.00	0.00
CompareListsNew	0	9.860 ns	0.0375 ns	0.0351 ns	1.11	0.00
CompareSequences	0	13.220 ns	0.2870 ns	0.5862 ns	1.46	0.05
CompareSequencesNew	0	14.935 ns	0.0784 ns	0.0733 ns	1.68	0.01

CompareLists	10	17.175 ns	0.0675 ns	0.0632 ns	1.00	0.00
CompareListsNew	10	10.120 ns	0.0416 ns	0.0389 ns	0.59	0.00
CompareSequences	10	22.020 ns	0.4596 ns	0.7679 ns	1.27	0.05
CompareSequencesNew	10	23.233 ns	0.0630 ns	0.0589 ns	1.35	0.01

CompareLists	100	97.854 ns	0.2919 ns	0.2588 ns	1.00	0.00
CompareListsNew	100	13.812 ns	0.0460 ns	0.0430 ns	0.14	0.00
CompareSequences	100	104.157 ns	1.2829 ns	1.2000 ns	1.07	0.01
CompareSequencesNew	100	104.935 ns	0.2738 ns	0.2561 ns	1.07	0.00

CompareLists	1000	1,068.779 ns	4.2436 ns	3.7618 ns	1.00	0.00
CompareListsNew	1000	61.721 ns	0.1062 ns	0.0993 ns	0.06	0.00
CompareSequences	1000	860.058 ns	3.0423 ns	2.8458 ns	0.80	0.00
CompareSequencesNew	1000	869.841 ns	2.3611 ns	2.0930 ns	0.81	0.00

(list here is List<int>, sequence here is Enumerable.Range)

Related: #97000 once/if this is fixed, list1.SequenceEqual(list2) will be able to get optimized down to direct call on their spans.

ghost · 2024-01-15T19:59:31Z

Tagging subscribers to this area: @dotnet/area-system-linq
See info in area-owners.md if you want to be subscribed.

Issue Details

Type test List<T> in Enumerable.SequenceEqual and forward to MemoryExtensions.SequenceEqual(span...), move the loops to local functions to allow the method to get inlined and type tests get optimized away.

Seems like a simple performance win that was accidentally missed when CollectionsMarshal.AsSpan was added.

Method	Length	Mean	Error	StdDev	Ratio	RatioSD
CompareLists	0	8.911 ns	0.0204 ns	0.0181 ns	1.00	0.00
CompareListsNew	0	9.860 ns	0.0375 ns	0.0351 ns	1.11	0.00
CompareSequences	0	13.220 ns	0.2870 ns	0.5862 ns	1.46	0.05
CompareSequencesNew	0	14.935 ns	0.0784 ns	0.0733 ns	1.68	0.01

CompareLists	10	17.175 ns	0.0675 ns	0.0632 ns	1.00	0.00
CompareListsNew	10	10.120 ns	0.0416 ns	0.0389 ns	0.59	0.00
CompareSequences	10	22.020 ns	0.4596 ns	0.7679 ns	1.27	0.05
CompareSequencesNew	10	23.233 ns	0.0630 ns	0.0589 ns	1.35	0.01

CompareLists	100	97.854 ns	0.2919 ns	0.2588 ns	1.00	0.00
CompareListsNew	100	13.812 ns	0.0460 ns	0.0430 ns	0.14	0.00
CompareSequences	100	104.157 ns	1.2829 ns	1.2000 ns	1.07	0.01
CompareSequencesNew	100	104.935 ns	0.2738 ns	0.2561 ns	1.07	0.00

CompareLists	1000	1,068.779 ns	4.2436 ns	3.7618 ns	1.00	0.00
CompareListsNew	1000	61.721 ns	0.1062 ns	0.0993 ns	0.06	0.00
CompareSequences	1000	860.058 ns	3.0423 ns	2.8458 ns	0.80	0.00
CompareSequencesNew	1000	869.841 ns	2.3611 ns	2.0930 ns	0.81	0.00

(sequence here is Enumerable.Range)

Related: #97000 once/if this is fixed, list1.SequenceEqual(list2) will be able to get optimized down to direct call on their spans.

Author:	neon-sunset
Assignees:	-
Labels:	`area-System.Linq`, `community-contribution`
Milestone:	-

src/libraries/System.Linq/src/System/Linq/SequenceEqual.cs

stephentoub · 2024-01-15T22:10:21Z

Seems like a simple performance win that was accidentally missed when CollectionsMarshal.AsSpan was added.

It's a win when the test succeeds. It's pure overhead when it doesn't. If your scenario involves many calls to short non-lists, it's a net negative, which is why it wasn't previously done. We've been slowly measuring and opting in more cases using TryGetSpan.

EgorBo · 2024-01-15T22:48:21Z

Seems like a simple performance win that was accidentally missed when CollectionsMarshal.AsSpan was added.

It's a win when the test succeeds. It's pure overhead when it doesn't. If your scenario involves many calls to short non-lists, it's a net negative, which is why it wasn't previously done. We've been slowly measuring and opting in more cases using TryGetSpan.

My 5 cents: there were multiple improvements for casts over last few years so maybe we can reconsider some of the decisions against adding more fast-paths in LINQ (including ReadOnlyCollection) E.g. recent "profiled casts" improved quite a few LINQ benchmarks.

stephentoub · 2024-01-16T00:41:20Z

there were multiple improvements for casts over last few years so maybe we can reconsider some of the decisions against adding more fast-paths in LINQ

That's why we've been more lenient over the last few releases.

recent "profiled casts" improved quite a few LINQ benchmarks.

I'm still a little skeptical of this one. It's great for microbenchmarks in LINQ, but these APIs ends up being used from many different call sites with many different inputs. I suspect the wins that show up in microbenchmarks won't have nearly as positive impact on many uses. I'm glad it's there, and it'll certainly help some cases, but we can't use it as a crutch, and could easily be mislead if we're not careful, I think.

EgorBo · 2024-01-16T00:58:51Z

I'm still a little skeptical of this one. It's great for microbenchmarks in LINQ, but these APIs ends up being used from many different call sites with many different inputs. I suspect the wins that show up in microbenchmarks won't have nearly as positive impact on many uses.

You can say the same about other things optimized with PGO in general. Polymorphic casts (where different callers use different data) are not expected to be optimized with PGO as we try to optimize only monomorphic cases. Sure, it doesn't work well if an app starts to behave differently after certain point, but we don't have a better option besides recommending turning PGO off completely for such apps. Eventually, we'll improve this by enabling context-sensitive instrumentation, partial inlining, instrumentation for inlinees, etc.

stephentoub · 2024-01-16T01:02:24Z

You can say the same about other things optimized with PGO in general.

I do :-) It's just this particular optimization applies particularly well to LINQ microbenchmarks, as you've called out.

eiriktsarpalis

I wouldn't object to this being included, however I don't think it's particularly impactful given that it only additionally targets List<T>. Are there any other collections we could extract the span from? I can only think of ImmutableArray<T>.

…ce comparison for arrays and lists

neon-sunset · 2024-01-16T17:54:33Z

Since #97005 deemed to be unprofitable, I have reverted the rest of the change and only kept seq is T[] -> seq.TryGetSpan.

Numbers

BenchmarkDotNet v0.13.12, Windows 11 (10.0.22631.3007/23H2/2023Update/SunValley3)
AMD Ryzen 7 5800X, 1 CPU, 16 logical and 8 physical cores
.NET SDK 8.0.101
  [Host]     : .NET 8.0.1 (8.0.123.58001), X64 RyuJIT AVX2
  DefaultJob : .NET 8.0.1 (8.0.123.58001), X64 RyuJIT AVX2

Method	Length	Mean	Error	StdDev	Ratio	RatioSD	Code Size
CompareLists	0	8.447 ns	0.0409 ns	0.0342 ns	1.00	0.00	1,272 B
CompareListsNew	0	5.535 ns	0.0194 ns	0.0172 ns	0.66	0.00	1,676 B
CompareSequences	0	13.437 ns	0.2927 ns	0.6548 ns	1.62	0.07	1,272 B
CompareSequencesNew	0	11.315 ns	0.0310 ns	0.0290 ns	1.34	0.01	1,355 B

CompareLists	10	16.974 ns	0.0190 ns	0.0178 ns	1.00	0.00	1,545 B
CompareListsNew	10	5.979 ns	0.0101 ns	0.0090 ns	0.35	0.00	1,682 B
CompareSequences	10	21.504 ns	0.4550 ns	0.8546 ns	1.26	0.06	1,507 B
CompareSequencesNew	10	17.947 ns	0.1111 ns	0.0985 ns	1.06	0.01	1,720 B

CompareLists	100	96.927 ns	0.1403 ns	0.1312 ns	1.00	0.00	1,545 B
CompareListsNew	100	9.580 ns	0.0187 ns	0.0175 ns	0.10	0.00	1,682 B
CompareSequences	100	100.535 ns	1.4505 ns	1.3568 ns	1.04	0.01	1,507 B
CompareSequencesNew	100	96.647 ns	0.0974 ns	0.0911 ns	1.00	0.00	1,720 B

CompareLists	1000	1,060.270 ns	2.2337 ns	2.0894 ns	1.00	0.00	1,556 B
CompareListsNew	1000	59.631 ns	0.0964 ns	0.0855 ns	0.06	0.00	1,682 B
CompareSequences	1000	851.900 ns	1.0912 ns	0.9673 ns	0.80	0.00	1,519 B
CompareSequencesNew	1000	851.612 ns	0.7819 ns	0.6931 ns	0.80	0.00	1,732 B

(the non-ICollection<T> path is now consistently faster as well which does not make much sense as nothing really stands out in the codegen aside from branch ordering and larger stack frame for spans but I'll take it)

As for extending TryGetSpan with other types - I like ReadOnlyCollections and ImmutableArray<T> a lot but that's for @stephentoub to decide :)

Thanks.

eiriktsarpalis

Thanks

…ce comparison for arrays and lists (dotnet#97004)

ghost added the community-contribution Indicates that the PR has been added by a community member label Jan 15, 2024

dotnet-issue-labeler bot added the area-System.Linq label Jan 15, 2024

stephentoub reviewed Jan 15, 2024

View reviewed changes

src/libraries/System.Linq/src/System/Linq/SequenceEqual.cs Outdated Show resolved Hide resolved

stephentoub reviewed Jan 15, 2024

View reviewed changes

src/libraries/System.Linq/src/System/Linq/SequenceEqual.cs Outdated Show resolved Hide resolved

neon-sunset force-pushed the sequenceequal-list branch from f47fcd4 to 1ed8ff9 Compare January 15, 2024 22:47

neon-sunset force-pushed the sequenceequal-list branch from 1ed8ff9 to 98a4605 Compare January 15, 2024 22:57

build-analysis bot mentioned this pull request Jan 15, 2024

Checkout failure: "Git fetch failed with exit code 128" dotnet/arcade#9009

Open

2 tasks

eiriktsarpalis reviewed Jan 16, 2024

View reviewed changes

Use .TryGetSpan on sequences instead of type checks to forward sequen…

1aa701a

…ce comparison for arrays and lists

neon-sunset force-pushed the sequenceequal-list branch from b548e3b to 1aa701a Compare January 16, 2024 17:41

Merge branch 'main' into sequenceequal-list

281b136

build-analysis bot mentioned this pull request Jan 16, 2024

Tests crashing in CI with no dump: exit code 137 means SIGKILL Killed #97049

Closed

eiriktsarpalis approved these changes Jan 17, 2024

View reviewed changes

stephentoub approved these changes Jan 18, 2024

View reviewed changes

stephentoub merged commit 957ab2f into dotnet:main Jan 18, 2024
106 of 111 checks passed

neon-sunset deleted the sequenceequal-list branch January 18, 2024 03:40

tmds pushed a commit to tmds/runtime that referenced this pull request Jan 23, 2024

Use .TryGetSpan on sequences instead of type checks to forward sequen…

e6fba36

…ce comparison for arrays and lists (dotnet#97004)

radekdoulik mentioned this pull request Jan 31, 2024

[Perf] Linux/x64: 4 Regressions on 1/18/2024 3:40:02 AM dotnet/perf-autofiling-issues#27931

Open

github-actions bot locked and limited conversation to collaborators Feb 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Type test `List<T>` in Enumerable.SequenceEquals and forward to `MemoryExtensions.SequenceEqual(span...)` #97004

Type test `List<T>` in Enumerable.SequenceEquals and forward to `MemoryExtensions.SequenceEqual(span...)` #97004

neon-sunset commented Jan 15, 2024 •

edited

Loading

ghost commented Jan 15, 2024

stephentoub commented Jan 15, 2024

EgorBo commented Jan 15, 2024 •

edited

Loading

stephentoub commented Jan 16, 2024

EgorBo commented Jan 16, 2024 •

edited

Loading

stephentoub commented Jan 16, 2024

eiriktsarpalis left a comment

neon-sunset commented Jan 16, 2024

eiriktsarpalis left a comment

Type test List<T> in Enumerable.SequenceEquals and forward to MemoryExtensions.SequenceEqual(span...) #97004

Type test List<T> in Enumerable.SequenceEquals and forward to MemoryExtensions.SequenceEqual(span...) #97004

Conversation

neon-sunset commented Jan 15, 2024 • edited Loading

ghost commented Jan 15, 2024

stephentoub commented Jan 15, 2024

EgorBo commented Jan 15, 2024 • edited Loading

stephentoub commented Jan 16, 2024

EgorBo commented Jan 16, 2024 • edited Loading

stephentoub commented Jan 16, 2024

eiriktsarpalis left a comment

Choose a reason for hiding this comment

neon-sunset commented Jan 16, 2024

eiriktsarpalis left a comment

Choose a reason for hiding this comment

Type test `List<T>` in Enumerable.SequenceEquals and forward to `MemoryExtensions.SequenceEqual(span...)` #97004

Type test `List<T>` in Enumerable.SequenceEquals and forward to `MemoryExtensions.SequenceEqual(span...)` #97004

neon-sunset commented Jan 15, 2024 •

edited

Loading

EgorBo commented Jan 15, 2024 •

edited

Loading

EgorBo commented Jan 16, 2024 •

edited

Loading