Improve vectorization of IndexOf(chars, StringComparison.OrdinalIgnoreCase) #85437

stephentoub · 2023-04-27T02:18:07Z

Use the same general "Algorithm 1: Generic SIMD" that we do for StringComparison.Ordinal, adapted for OrdinalIgnoreCase.

private static readonly string s_haystack = new HttpClient().GetStringAsync("https://www.gutenberg.org/files/1661/1661-0.txt").Result;

[Params("watson", "elementary", "holmes", "the")]
public string Needle { get; set; }

[Benchmark]
public int Count()
{
    int count = 0;
    ReadOnlySpan<char> haystack = s_haystack;
    while (true)
    {
        int pos = haystack.IndexOf(Needle, StringComparison.OrdinalIgnoreCase);
        if (pos < 0) break;
        count++;
        haystack = haystack.Slice(pos + Needle.Length);
    }
    return count;
}

Method	Toolchain	Needle	Mean	Error	StdDev	Ratio
Count	\main\corerun.exe	elementary	580.54 us	3.562 us	2.781 us	1.00
Count	\pr\corerun.exe	elementary	59.37 us	0.447 us	0.397 us	0.10

Count	\main\corerun.exe	holmes	366.58 us	0.607 us	0.568 us	1.00
Count	\pr\corerun.exe	holmes	82.55 us	0.204 us	0.181 us	0.23

Count	\main\corerun.exe	the	547.91 us	1.257 us	1.050 us	1.00
Count	\pr\corerun.exe	the	258.62 us	1.123 us	0.996 us	0.47

Count	\main\corerun.exe	watson	230.76 us	0.550 us	0.514 us	1.00
Count	\pr\corerun.exe	watson	58.24 us	0.448 us	0.419 us	0.25

ghost · 2023-04-27T02:18:16Z

Tagging subscribers to this area: @dotnet/area-system-runtime
See info in area-owners.md if you want to be subscribed.

Issue Details

Use the same general "Algorithm 1: Generic SIMD" that we do for StringComparison.Ordinal, adapter for OrdinalIgnoreCase.

[Params("watson", "elementary", "holmes", "the")]
public string Needle { get; set; }

[Benchmark]
public int Count()
{
    int count = 0;
    ReadOnlySpan<char> haystack = s_haystack;
    while (true)
    {
        int pos = haystack.IndexOf(Needle, StringComparison.OrdinalIgnoreCase);
        if (pos < 0) break;
        count++;
        haystack = haystack.Slice(pos + Needle.Length);
    }
    return count;
}

Method	Toolchain	Needle	Mean	Error	StdDev	Ratio
Count	\main\corerun.exe	elementary	580.54 us	3.562 us	2.781 us	1.00
Count	\pr\corerun.exe	elementary	59.37 us	0.447 us	0.397 us	0.10

Count	\main\corerun.exe	holmes	366.58 us	0.607 us	0.568 us	1.00
Count	\pr\corerun.exe	holmes	82.55 us	0.204 us	0.181 us	0.23

Count	\main\corerun.exe	the	547.91 us	1.257 us	1.050 us	1.00
Count	\pr\corerun.exe	the	258.62 us	1.123 us	0.996 us	0.47

Count	\main\corerun.exe	watson	230.76 us	0.550 us	0.514 us	1.00
Count	\pr\corerun.exe	watson	58.24 us	0.448 us	0.419 us	0.25

Author:	stephentoub
Assignees:	-
Labels:	`area-System.Runtime`, `tenet-performance`
Milestone:	8.0.0

…eCase) Use the same general "Algorithm 1: Generic SIMD" that we do for StringComparison.Ordinal, adapter for OrdinalIgnoreCase.

EgorBo · 2023-05-01T14:14:57Z

src/libraries/System.Private.CoreLib/src/System/Globalization/Ordinal.cs

+                    // Load a vector from the current search space offset and another from the offset plus the distance between the two characters.
+                    // For each, | with 0x20 so that letters are lowercased, then & those together to get a mask. If the mask is all zeros, there
+                    // was no match.  If it wasn't, we have to do more work to check for a match.
+                    Vector128<ushort> cmpCh2 = Vector128.Equals(ch2, Vector128.BitwiseOr(Vector128.LoadUnsafe(ref searchSpace, (nuint)(offset + ch1ch2Distance)), Vector128.Create((ushort)0x20)));


Very nit: Vector128.BitwiseOr -> |

This is just style, right? Happy to change it, just questioning whether it's worth rerunning ci.

definitely not worth it 🙂

Right its "just style". There is also the general considerations of "methods" vs "operators" (such as precedence and readability) but we're not super consistent today just due to the operators being relatively new.

EgorBo

Nice! Assuming we're fine with the overhead for the worst case - it's slightly bigger in case of OrdinalIgnoreCase due to more work + the path to find unique chars is more expensive, but the benefits should outweight that 👍

stephentoub · 2023-05-01T14:36:30Z

I think it's worth it. It's more expensive to set up than ordinal, but the match validation that happens on every potential match is also more expensive, and this generally lessens the latter. It will regress in cases similar to ordinal regressed, eg where the starting character never matches, but on the balance I expect it'll be a meaningful win. Let's try and see what falls out. :-)

stephentoub added area-System.Runtime tenet-performance Performance related issue labels Apr 27, 2023

stephentoub added this to the 8.0.0 milestone Apr 27, 2023

stephentoub requested review from EgorBo and tannergooding April 27, 2023 02:18

ghost assigned stephentoub Apr 27, 2023

stephentoub mentioned this pull request Apr 27, 2023

Enable regex to use IndexOf(..., OrdinalIgnoreCase) for prefix searching #85438

Merged

dotnet deleted a comment Apr 27, 2023

Improve vectorization of IndexOf(chars, StringComparison.OrdinalIgnor…

fe53637

…eCase) Use the same general "Algorithm 1: Generic SIMD" that we do for StringComparison.Ordinal, adapter for OrdinalIgnoreCase.

stephentoub force-pushed the vectorordinalignorecase branch from 3fb61ee to fe53637 Compare April 27, 2023 20:29

Fix duplicate local

b2c58d1

stephentoub requested a review from MihaZupan April 28, 2023 15:00

Merge branch 'dotnet:main' into vectorordinalignorecase

66bf449

EgorBo reviewed May 1, 2023

View reviewed changes

EgorBo approved these changes May 1, 2023

View reviewed changes

stephentoub merged commit 80cd72e into dotnet:main May 1, 2023

stephentoub deleted the vectorordinalignorecase branch May 1, 2023 14:36

cincuranet mentioned this pull request May 4, 2023

Regressions in System.Memory.ReadOnlySpan.IndexOfString #85756

Closed

This was referenced May 4, 2023

[Perf] Linux/x64: 2 Regressions on 5/1/2023 3:42:23 PM dotnet/perf-autofiling-issues#17237

Closed

[Perf] Linux/x64: 3 Regressions on 5/1/2023 3:42:23 PM dotnet/perf-autofiling-issues#17208

Closed

This was referenced May 4, 2023

[Perf] Windows/x64: 67 Improvements on 4/25/2023 9:59:52 PM dotnet/perf-autofiling-issues#17240

Closed

[Perf] Linux/x64: 1 Improvement on 5/1/2023 3:42:23 PM dotnet/perf-autofiling-issues#17588

Closed

This was referenced May 10, 2023

[Perf] Linux/arm64: 8 Improvements on 5/2/2023 12:34:35 AM dotnet/perf-autofiling-issues#17517

Closed

[Perf] Linux/x64: 3 Regressions on 5/1/2023 3:42:23 PM dotnet/perf-autofiling-issues#17505

Closed

This was referenced May 10, 2023

[Perf] Linux/arm64: 5 Improvements on 5/1/2023 10:26:58 PM dotnet/perf-autofiling-issues#17513

Closed

[Perf] Linux/x64: 4 Regressions on 5/1/2023 6:56:14 PM dotnet/perf-autofiling-issues#17553

Closed

kunalspathak mentioned this pull request May 16, 2023

[Perf] Windows/x64: 1 Improvement on 5/1/2023 3:42:23 PM dotnet/perf-autofiling-issues#17618

Closed

ghost locked as resolved and limited conversation to collaborators May 31, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve vectorization of IndexOf(chars, StringComparison.OrdinalIgnoreCase) #85437

Improve vectorization of IndexOf(chars, StringComparison.OrdinalIgnoreCase) #85437

stephentoub commented Apr 27, 2023 •

edited

Loading

ghost commented Apr 27, 2023

EgorBo May 1, 2023

stephentoub May 1, 2023

EgorBo May 1, 2023

tannergooding May 1, 2023

EgorBo left a comment

stephentoub commented May 1, 2023

Improve vectorization of IndexOf(chars, StringComparison.OrdinalIgnoreCase) #85437

Improve vectorization of IndexOf(chars, StringComparison.OrdinalIgnoreCase) #85437

Conversation

stephentoub commented Apr 27, 2023 • edited Loading

ghost commented Apr 27, 2023

EgorBo May 1, 2023

Choose a reason for hiding this comment

stephentoub May 1, 2023

Choose a reason for hiding this comment

EgorBo May 1, 2023

Choose a reason for hiding this comment

tannergooding May 1, 2023

Choose a reason for hiding this comment

EgorBo left a comment

Choose a reason for hiding this comment

stephentoub commented May 1, 2023

stephentoub commented Apr 27, 2023 •

edited

Loading