Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

NEON: implement all bf16-related intrinsics #1110

Merged
merged 11 commits into from
Nov 20, 2023
Merged

Conversation

yyctw
Copy link
Contributor

@yyctw yyctw commented Nov 17, 2023

Hi all, this is Eric from Andes Technology Corporation. This PR includes the following:

Added bf16 type to SIMDe.
Added the +bf16 option to two cross files: aarch64-clang-15-ccache.cross and aarch64-gcc-12-ccache.cross.
Implemented all bf16 type related intrinsics along with test cases.
Added 133 initial implementations and corresponding test cases in 47 families which are listed below:

  • combine, copy_lane, create, cvt, dot, dot_lane, dup_lane, dup_n, fmlal, get_high,
  • get_lane, get_low, ld1, ld1_dup, ld1_lane, ld1_x2, ld1_x3, ld1_x4, ld1q_x2, ld1q_x3,
  • ld1q_x4, ld2, ld2_dup, ld2_lane, ld3, ld3_dup, ld3_lane, ld4, ld4_dup, ld4_lane,
  • mmlaq, reinterpret, set_lane, st1, st1_lane, st1_x2, st1_x3, st1_x4, st1q_x2, st1q_x3,
  • st1q_x4, st2, st2_lane, st3, st3_lane, st4, st4_lane.

Thanks for reading and any recommendations are welcome🎉🎉🎉!

Copy link
Collaborator

@mr-c mr-c left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you!

simde/arm/neon/dot.h Outdated Show resolved Hide resolved
simde/arm/neon/reinterpret.h Show resolved Hide resolved
simde/arm/neon/reinterpret.h Show resolved Hide resolved
simde/simde-bf16.h Outdated Show resolved Hide resolved
simde/simde-bf16.h Outdated Show resolved Hide resolved
simde/arm/neon/dup_n.h Outdated Show resolved Hide resolved
simde/arm/neon/dup_n.h Outdated Show resolved Hide resolved
simde/arm/neon/reinterpret.h Show resolved Hide resolved
simde/arm/neon/reinterpret.h Show resolved Hide resolved
test/arm/neon/dup_lane.c Outdated Show resolved Hide resolved
@yyctw
Copy link
Contributor Author

yyctw commented Nov 20, 2023

All comments have been completed.

@mr-c mr-c merged commit c59db7c into simd-everywhere:master Nov 20, 2023
72 of 75 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants