Add all integer overloads for Avx2.BlendVariable #10679

fiigii · 2018-07-13T21:43:20Z

Now, we only have the byte/sbyte overload for Avx2.BlendVariable since the instruction uses the mask vector as a byte sequence.
However, Avx2.BlendVariable is very common to use, adding all integer overloads (short/ushort/int/uint/long/ulong) will significantly improve the user experience.

cc @tannergooding @CarolEidt @eerhardt

The text was updated successfully, but these errors were encountered:

saucecontrol · 2018-07-13T21:51:45Z

If you do it on Avx2.BlendVariable, I'd think it should be done on all the byte-shuffle ops. Ssse3.Shuffle would also need them, for example.

And there are less obvious ones like Ssse3.AlignRight.

Now that the StaticCast calls get folded, it's strictly a usability issue. Perhaps the answer is to improve the StaticCast to be more terse?

fiigii · 2018-07-13T21:55:34Z

If you do it on Avx2.BlendVariable, I'd think it should be done on all the byte-shuffle ops. Ssse3.Shuffle would also need them, for example.

Agree, I will investigate it and put here later. Thanks!

Perhaps the answer is to improve the StaticCast to be more terse?

Don't you mean eliminating StaticCast and using implicit conversion of Vector128/256 instead?

saucecontrol · 2018-07-13T22:01:24Z

Don't you mean eliminating StaticCast and using implicit conversion of Vector128/256 instead?

Yeah, that might be ideal. I remember seeing the discussions around when generics would be used and when the argument types would be exploded, and while the rules make a lot of sense, there's so much flexibility built into the way the Intel intrinsics are defined with __m128i and __m256i that it would mean a ton of overloads to cover everything people will be doing with these instructions.

tannergooding · 2018-07-13T22:04:57Z

Perhaps the answer is to improve the StaticCast to be more terse?

One of the proposals here was to partially explode the StaticCast methods. That is, where today you have Vector128<U> StaticCast<T, U>(Vector128<T>) you could instead do Vector128<T> StaticCast<T>(Vector128<float>) (and explode the other 9 types as well). This makes it significantly less verbose.

using implicit conversion of Vector128/256 instead?

I don't think implicit conversions are a good idea, and they aren't supported with some languages (such as F#).

tannergooding · 2018-07-13T22:06:06Z

As for the general proposal. I think it would depend on how confusing it is perceived to pass in a Vector128<int> but operate as if it was a Vector128<byte>.

saucecontrol · 2018-07-13T22:08:23Z

you could instead do Vector128<T> StaticCast<T>(Vector128<float>)

I like this idea a lot. It should totally be a thing.

Also, what was the call on the naming? I remember seeing a suggestion that it be called ReinterpretCast, but why not just Cast? That's what's used in System.Memory for reinterpreting Span<T>s

Also also, let's say that doesn't make it past API review... If I were to define my own partially exploded Cast, would that get inlined and then folded away same as the official one?

I think it would depend on how confusing it is perceived to pass in a Vector128<int> but operate as if it was a Vector128<byte>.

I think it goes back to the target audience for these. You have to assume the user knows what the instruction does, and if the instruction operates on the byte level, I'd know that if I were calling it, regardless of the argument types.

tannergooding · 2018-07-15T14:02:07Z

Also also, let's say that doesn't make it past API review... If I were to define my own partially exploded Cast, would that get inlined and then folded away same as the official one?

Provided it called the underlying StaticCast methods, it should.

4creators · 2018-07-18T10:00:04Z

One of the proposals here was to partially explode the StaticCast methods. That is, where today you have Vector128 StaticCast<T, U>(Vector128) you could instead do Vector128 StaticCast(Vector128) (and explode the other 9 types as well).

Proposal to simplify StaticCast<T, U> API is here: https://github.com/dotnet/corefx/issues/27911#issuecomment-372684004

tannergooding closed this as completed in dotnet/coreclr#19420 Sep 20, 2018

msftgits transferred this issue from dotnet/coreclr Jan 31, 2020

ghost locked as resolved and limited conversation to collaborators Dec 16, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add all integer overloads for Avx2.BlendVariable #10679

Add all integer overloads for Avx2.BlendVariable #10679

fiigii commented Jul 13, 2018

saucecontrol commented Jul 13, 2018

fiigii commented Jul 13, 2018

saucecontrol commented Jul 13, 2018 •

edited

Loading

tannergooding commented Jul 13, 2018

tannergooding commented Jul 13, 2018

saucecontrol commented Jul 13, 2018 •

edited

Loading

tannergooding commented Jul 15, 2018

4creators commented Jul 18, 2018

Add all integer overloads for Avx2.BlendVariable #10679

Add all integer overloads for Avx2.BlendVariable #10679

Comments

fiigii commented Jul 13, 2018

saucecontrol commented Jul 13, 2018

fiigii commented Jul 13, 2018

saucecontrol commented Jul 13, 2018 • edited Loading

tannergooding commented Jul 13, 2018

tannergooding commented Jul 13, 2018

saucecontrol commented Jul 13, 2018 • edited Loading

tannergooding commented Jul 15, 2018

4creators commented Jul 18, 2018

saucecontrol commented Jul 13, 2018 •

edited

Loading

saucecontrol commented Jul 13, 2018 •

edited

Loading