Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[LLVMGPU][ROCm] Plumb through i8, i8 -> i32 MFMA intrinsics #17764

Merged
merged 9 commits into from
Jun 28, 2024
Merged

Conversation

kuhar
Copy link
Member

@kuhar kuhar commented Jun 27, 2024

Add tests to make sure these are generated in the vector distribution pipeline. Add e2e correctness tests.

I also tested this manually on random inputs against golden outputs from numpy.

This contains one cherry-pick for llvm-project.

Copy link

google-cla bot commented Jun 27, 2024

Thanks for your pull request! It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

View this failed invocation of the CLA check for more information.

For the most up to date status, view the checks section at the bottom of the pull request.

@kuhar kuhar enabled auto-merge (squash) June 28, 2024 14:35
@kuhar kuhar merged commit dcba7c5 into main Jun 28, 2024
53 of 54 checks passed
@kuhar kuhar deleted the shared/int8mfma branch June 28, 2024 14:58
LLITCHEV pushed a commit to LLITCHEV/iree that referenced this pull request Jul 30, 2024
…#17764)

Add tests to make sure these are generated in the vector distribution
pipeline. Add e2e correctness tests.

I also tested this manually on random inputs against golden outputs from
numpy.

This contains one cherry-pick for llvm-project.

---------

Co-authored-by: Stanley Winata <[email protected]>
Co-authored-by: Lei Zhang <[email protected]>
Signed-off-by: Lubo Litchev <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants