Lowering Vector512() methods : comparison + shift #84942

DeepakRajendrakumaran · 2023-04-17T19:12:50Z

Includes following

Comparison : LessThan(), LessThanOrEqual(), GreaterThan(), GreaterThanOrEqual() +corresponding *ANY(), *ALL() and ConditionalSelect()

Arithmetic : ShiftLeft, ShiftRight, ShiftRightArithmetic

Open : Some cases for ConditionalSelect() uses blend. Skipping for now. Have some other thoughts here anyway : using VPTERNLOG()??

@dotnet/avx512-contrib

src/coreclr/jit/gentree.cpp

DeepakRajendrakumaran · 2023-04-17T23:27:36Z

@tannergooding @BruceForstall

BruceForstall · 2023-04-17T23:30:16Z

cc @dotnet/jit-contrib

tannergooding · 2023-04-18T23:01:47Z

There's some merge conflicts that need to be resolved, let me know if you need any assistance with them.

DeepakRajendrakumaran · 2023-04-19T00:53:34Z

There's some merge conflicts that need to be resolved, let me know if you need any assistance with them.

Have resolved them

src/coreclr/jit/hwintrinsiclistxarch.h

src/coreclr/jit/gentree.cpp

src/coreclr/jit/hwintrinsicxarch.cpp

…corresponding *ANY(), *ALL() and ConditionaSelect() ShiftLeft, ShiftRight, ShiftRightArithmetic

tannergooding · 2023-04-19T18:37:36Z

src/coreclr/jit/gentree.cpp

+                // TODO-XArch-CQ: It's a non-trivial amount of work to support these
+                // for floating-point while only utilizing AVX. It would require, among
+                // other things, inverting the comparison and potentially support for a
+                // new Avx.TestNotZ intrinsic to ensure the codegen remains efficient.
+                assert(compIsaSupportedDebugOnly(InstructionSet_AVX2));
+                intrinsic = NI_Vector256_op_Equality;


For other reviewers, this is just a copy/paste of an existing comment that used to be shared across the total of GE/GT/LE/LT

It's actually a lot easier for us to handle this today if we wanted to and is maybe a small/easy win we can do for .NET 8 (of course in a separate PR), so having the logic duplicated now will make doing that a bit simpler

BruceForstall · 2023-04-19T22:42:46Z

Manually triggered replay to try to get past infra issue: https://dev.azure.com/dnceng-public/public/_build/results?buildId=245191&view=results

DeepakRajendrakumaran · 2023-04-20T17:17:46Z

Manually triggered replay to try to get past infra issue: https://dev.azure.com/dnceng-public/public/_build/results?buildId=245191&view=results

Thanks @BruceForstall .Looks like it passed on rerun. Do let me know if any other changes are needed or this is good to go.

kunalspathak · 2023-04-20T21:51:40Z

src/coreclr/jit/gentree.cpp

-                intrinsic = NI_Vector512_op_Equality;
-            }
-            else if (simdSize == 32)
+            if (simdSize == 32)


just curious, why we decided to check for 32 , then 64 and then (implicit) 16 or less? Is it because Vector256 is more common and should hit that condition first from TP perspective?

Yep. Initially when we had throughput issues, this was one of the things we tried. Making check for 32 the first(being the most common case)

src/coreclr/jit/lowerxarch.cpp

kunalspathak

A suggestion and a question. Feel free to do it in follow-up PR or including it with the next PR.

Co-authored-by: Kunal Pathak <[email protected]>

tannergooding · 2023-04-20T22:29:59Z

Resolved conflict with multiply pr

BruceForstall · 2023-04-20T23:13:06Z

Resolved conflict with #85070

dotnet-issue-labeler bot added the area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI label Apr 17, 2023

DeepakRajendrakumaran force-pushed the Deepak_comparison branch from 7cde562 to 34fb88b Compare April 17, 2023 19:41

jnyrup reviewed Apr 17, 2023

View reviewed changes

src/coreclr/jit/gentree.cpp Outdated Show resolved Hide resolved

DeepakRajendrakumaran force-pushed the Deepak_comparison branch from 34fb88b to 00ced07 Compare April 17, 2023 20:35

BruceForstall added the avx512 Related to the AVX-512 architecture label Apr 17, 2023

DeepakRajendrakumaran force-pushed the Deepak_comparison branch from 00ced07 to 8b7ec63 Compare April 17, 2023 23:26

DeepakRajendrakumaran marked this pull request as ready for review April 17, 2023 23:27

BruceForstall requested a review from tannergooding April 17, 2023 23:30

This was referenced Apr 18, 2023

Tracking issue for CI build timeouts #76454

Closed

nativeaot/SmokeTests/DwarfDump failing on linux-x64 Debug #84979

Closed

DeepakRajendrakumaran force-pushed the Deepak_comparison branch from 8b7ec63 to 8eed177 Compare April 19, 2023 00:52

tannergooding reviewed Apr 19, 2023

View reviewed changes

src/coreclr/jit/hwintrinsiclistxarch.h Outdated Show resolved Hide resolved

tannergooding reviewed Apr 19, 2023

View reviewed changes

src/coreclr/jit/gentree.cpp Show resolved Hide resolved

tannergooding reviewed Apr 19, 2023

View reviewed changes

src/coreclr/jit/hwintrinsicxarch.cpp Outdated Show resolved Hide resolved

DeepakRajendrakumaran force-pushed the Deepak_comparison branch 3 times, most recently from ad69641 to 46b5933 Compare April 19, 2023 18:24

LessThan(), LessThanOrEqual(), GreaterThan(), GreaterThanOrEqual() + …

46b5933

…corresponding *ANY(), *ALL() and ConditionaSelect() ShiftLeft, ShiftRight, ShiftRightArithmetic

tannergooding reviewed Apr 19, 2023

View reviewed changes

tannergooding approved these changes Apr 19, 2023

View reviewed changes

kunalspathak reviewed Apr 20, 2023

View reviewed changes

src/coreclr/jit/lowerxarch.cpp Show resolved Hide resolved

kunalspathak approved these changes Apr 20, 2023

View reviewed changes

Update src/coreclr/jit/lowerxarch.cpp : review comment

af69e9f

Co-authored-by: Kunal Pathak <[email protected]>

Merge branch 'main' into Deepak_comparison

d25c47b

Merge branch 'main' into Deepak_comparison

b64d797

tannergooding merged commit f8d1116 into dotnet:main Apr 21, 2023

ghost locked as resolved and limited conversation to collaborators May 21, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lowering Vector512() methods : comparison + shift #84942

Lowering Vector512() methods : comparison + shift #84942

DeepakRajendrakumaran commented Apr 17, 2023 •

edited

Loading

DeepakRajendrakumaran commented Apr 17, 2023

BruceForstall commented Apr 17, 2023

tannergooding commented Apr 18, 2023

DeepakRajendrakumaran commented Apr 19, 2023

tannergooding Apr 19, 2023

BruceForstall commented Apr 19, 2023

DeepakRajendrakumaran commented Apr 20, 2023

kunalspathak Apr 20, 2023

DeepakRajendrakumaran Apr 20, 2023

kunalspathak left a comment

tannergooding commented Apr 20, 2023 •

edited

Loading

BruceForstall commented Apr 20, 2023 •

edited

Loading

Lowering Vector512() methods : comparison + shift #84942

Lowering Vector512() methods : comparison + shift #84942

Conversation

DeepakRajendrakumaran commented Apr 17, 2023 • edited Loading

DeepakRajendrakumaran commented Apr 17, 2023

BruceForstall commented Apr 17, 2023

tannergooding commented Apr 18, 2023

DeepakRajendrakumaran commented Apr 19, 2023

tannergooding Apr 19, 2023

Choose a reason for hiding this comment

BruceForstall commented Apr 19, 2023

DeepakRajendrakumaran commented Apr 20, 2023

kunalspathak Apr 20, 2023

Choose a reason for hiding this comment

DeepakRajendrakumaran Apr 20, 2023

Choose a reason for hiding this comment

kunalspathak left a comment

Choose a reason for hiding this comment

tannergooding commented Apr 20, 2023 • edited Loading

BruceForstall commented Apr 20, 2023 • edited Loading

DeepakRajendrakumaran commented Apr 17, 2023 •

edited

Loading

tannergooding commented Apr 20, 2023 •

edited

Loading

BruceForstall commented Apr 20, 2023 •

edited

Loading