Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

JIT ARM64-SVE: Implement IF_SVE_FD_3A, IF_SVE_FD_3B, IF_SVE_FD_3C #97934

Merged
merged 2 commits into from
Feb 5, 2024

Conversation

amanasifkhalid
Copy link
Member

Part of #94549.

cstool output:

mul   z0.h, z1.h, z1.h[1]
mul   z2.h, z3.h, z3.h[3]
mul   z4.h, z5.h, z5.h[5]
mul   z6.h, z7.h, z7.h[7]
mul   z8.s, z9.s, z1.s[0]
mul   z10.s, z11.s, z3.s[1]
mul   z12.s, z13.s, z5.s[2]
mul   z14.s, z15.s, z7.s[3]
mul   z16.d, z17.d, z0.d[0]
mul   z18.d, z19.d, z5.d[1]
mul   z20.d, z21.d, z10.d[0]
mul   z22.d, z23.d, z15.d[1]

JitDisasm output:

mul     z0.h, z1.h, z1.h[1]
mul     z2.h, z3.h, z3.h[3]
mul     z4.h, z5.h, z5.h[5]
mul     z6.h, z7.h, z7.h[7]
mul     z8.s, z9.s, z1.s[0]
mul     z10.s, z11.s, z3.s[1]
mul     z12.s, z13.s, z5.s[2]
mul     z14.s, z15.s, z7.s[3]
mul     z16.d, z17.d, z0.d[0]
mul     z18.d, z19.d, z5.d[1]
mul     z20.d, z21.d, z10.d[0]
mul     z22.d, z23.d, z15.d[1]

cc @dotnet/arm64-contrib.

@amanasifkhalid amanasifkhalid added the arm-sve Work related to arm64 SVE/SVE2 support label Feb 4, 2024
@dotnet-issue-labeler dotnet-issue-labeler bot added the area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI label Feb 4, 2024
@ghost ghost assigned amanasifkhalid Feb 4, 2024
@ghost
Copy link

ghost commented Feb 4, 2024

Tagging subscribers to this area: @JulieLeeMSFT, @jakobbotsch
See info in area-owners.md if you want to be subscribed.

Issue Details

Part of #94549.

cstool output:

mul   z0.h, z1.h, z1.h[1]
mul   z2.h, z3.h, z3.h[3]
mul   z4.h, z5.h, z5.h[5]
mul   z6.h, z7.h, z7.h[7]
mul   z8.s, z9.s, z1.s[0]
mul   z10.s, z11.s, z3.s[1]
mul   z12.s, z13.s, z5.s[2]
mul   z14.s, z15.s, z7.s[3]
mul   z16.d, z17.d, z0.d[0]
mul   z18.d, z19.d, z5.d[1]
mul   z20.d, z21.d, z10.d[0]
mul   z22.d, z23.d, z15.d[1]

JitDisasm output:

mul     z0.h, z1.h, z1.h[1]
mul     z2.h, z3.h, z3.h[3]
mul     z4.h, z5.h, z5.h[5]
mul     z6.h, z7.h, z7.h[7]
mul     z8.s, z9.s, z1.s[0]
mul     z10.s, z11.s, z3.s[1]
mul     z12.s, z13.s, z5.s[2]
mul     z14.s, z15.s, z7.s[3]
mul     z16.d, z17.d, z0.d[0]
mul     z18.d, z19.d, z5.d[1]
mul     z20.d, z21.d, z10.d[0]
mul     z22.d, z23.d, z15.d[1]

cc @dotnet/arm64-contrib.

Author: amanasifkhalid
Assignees: amanasifkhalid
Labels:

area-CodeGen-coreclr, arch-arm64-sve

Milestone: -

@ryujit-bot
Copy link

Diff results for #97934

Throughput diffs

Throughput diffs for linux/arm64 ran on windows/x64

MinOpts (-0.01% to +0.00%)
Collection PDIFF
libraries.pmi.linux.arm64.checked.mch -0.01%

Details here


@ryujit-bot
Copy link

Diff results for #97934

Throughput diffs

Throughput diffs for linux/arm64 ran on linux/x64

Overall (-0.01% to -0.00%)
Collection PDIFF
libraries_tests_no_tiered_compilation.run.linux.arm64.Release.mch -0.01%
libraries_tests.run.linux.arm64.Release.mch -0.01%
libraries.pmi.linux.arm64.checked.mch -0.01%
benchmarks.run_tiered.linux.arm64.checked.mch -0.01%
benchmarks.run_pgo.linux.arm64.checked.mch -0.01%
benchmarks.run.linux.arm64.checked.mch -0.01%
realworld.run.linux.arm64.checked.mch -0.01%
coreclr_tests.run.linux.arm64.checked.mch -0.01%
libraries.crossgen2.linux.arm64.checked.mch -0.01%
MinOpts (-0.02% to -0.00%)
Collection PDIFF
libraries_tests_no_tiered_compilation.run.linux.arm64.Release.mch -0.01%
libraries_tests.run.linux.arm64.Release.mch -0.01%
libraries.pmi.linux.arm64.checked.mch -0.01%
benchmarks.run_tiered.linux.arm64.checked.mch -0.01%
benchmarks.run_pgo.linux.arm64.checked.mch -0.01%
benchmarks.run.linux.arm64.checked.mch -0.01%
realworld.run.linux.arm64.checked.mch -0.02%
coreclr_tests.run.linux.arm64.checked.mch -0.01%
FullOpts (-0.01% to -0.00%)
Collection PDIFF
libraries_tests_no_tiered_compilation.run.linux.arm64.Release.mch -0.01%
libraries.pmi.linux.arm64.checked.mch -0.01%
benchmarks.run_tiered.linux.arm64.checked.mch -0.01%
benchmarks.run_pgo.linux.arm64.checked.mch -0.01%
benchmarks.run.linux.arm64.checked.mch -0.01%
realworld.run.linux.arm64.checked.mch -0.01%
coreclr_tests.run.linux.arm64.checked.mch -0.01%
libraries.crossgen2.linux.arm64.checked.mch -0.01%

Details here


Copy link
Member

@kunalspathak kunalspathak left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@amanasifkhalid amanasifkhalid merged commit 9d7c48e into dotnet:main Feb 5, 2024
6 of 16 checks passed
@amanasifkhalid amanasifkhalid deleted the sve-mul branch February 5, 2024 22:11
@github-actions github-actions bot locked and limited conversation to collaborators Mar 7, 2024
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI arm-sve Work related to arm64 SVE/SVE2 support
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants