x64: Implement SIMD `fma` #4474

afonso360 · 2022-07-20T11:38:20Z

👋 Hey,

Following the discussion in #4462, this PR adds a VEX encoder and lowering for SIMD fma with FMA extension instructions.

The VEX Encoder has an API similar to the EVEX encoder, however instead of writing the instruction as the caller is calling the fields, we store them all and later write the full instruction in encode.

This is because the VEX instruction format is much less predictable than EVEX and I couldn't find a way to cleanly write the bit fields, since at any time we can switch from 2 to 3 byte prefixes.

@abrown mentioned in #4462, there are potentially some improvements that we may want to do.

With that in place, we would probably need think through how to emit AVX instructions; e.g., something like Avx512Opcode but perhaps we want to be able to decide on the VEX/EVEX encoding at a later time (?)

Right now this implements AvxOpcode and always lowers it in VEX encoding. However I like the idea of choosing the encoding based on what is available and most efficient. Suggestions on how we would implement this would be appreciated!

Fixes #4462

This uses a similar builder pattern to the EVEX Encoder. Does not yet support memory accesses.

github-actions · 2022-07-20T11:43:42Z

Subscribe to Label Action

cc @fitzgen, @peterhuene

This issue or pull request has been labeled: "cranelift", "cranelift:area:x64", "cranelift:meta", "fuzzing", "wasmtime:api"

Thus the following users have been cc'd because of the following labels:

fitzgen: fuzzing
peterhuene: wasmtime:api

To subscribe or unsubscribe from this label, edit the .github/subscribe-to-label.json configuration file.

Learn more.

abrown

LGTM!

cranelift/codegen/src/isa/x64/inst.isle

cfallin

New 4-register form looks good except for the pretty-print, where I think a few of the args are unintentionally reordered.

cranelift/codegen/src/isa/x64/inst/mod.rs

afonso360 added 2 commits July 19, 2022 19:49

x64: Add VEX Instruction Encoder

4fc99d9

This uses a similar builder pattern to the EVEX Encoder. Does not yet support memory accesses.

x64: Add FMA Flag

f79671e

afonso360 force-pushed the x86-vex branch 2 times, most recently from c4cf522 to f9ef5bb Compare July 20, 2022 12:28

x64: Implement SIMD fma

10d2da2

afonso360 force-pushed the x86-vex branch from f9ef5bb to 10d2da2 Compare July 20, 2022 12:29

abrown approved these changes Jul 25, 2022

View reviewed changes

cranelift/codegen/src/isa/x64/inst.isle Outdated Show resolved Hide resolved

x64: Use 4 register Vex Inst

9f52094

afonso360 force-pushed the x86-vex branch from a61b6b0 to 9f52094 Compare July 25, 2022 19:25

cfallin reviewed Jul 25, 2022

View reviewed changes

cranelift/codegen/src/isa/x64/inst/mod.rs Show resolved Hide resolved

x64: Reorder VEX pretty print args

902ac6c

afonso360 force-pushed the x86-vex branch from d0fcc93 to 902ac6c Compare July 25, 2022 20:56

cfallin approved these changes Jul 25, 2022

View reviewed changes

cfallin enabled auto-merge (squash) July 25, 2022 21:30

cfallin merged commit 02c3b47 into bytecodealliance:main Jul 25, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

x64: Implement SIMD `fma` #4474

x64: Implement SIMD `fma` #4474

afonso360 commented Jul 20, 2022 •

edited

Loading

github-actions bot commented Jul 20, 2022

abrown left a comment

cfallin left a comment

x64: Implement SIMD fma #4474

x64: Implement SIMD fma #4474

Conversation

afonso360 commented Jul 20, 2022 • edited Loading

github-actions bot commented Jul 20, 2022

Subscribe to Label Action

abrown left a comment

Choose a reason for hiding this comment

cfallin left a comment

Choose a reason for hiding this comment

x64: Implement SIMD `fma` #4474

x64: Implement SIMD `fma` #4474

afonso360 commented Jul 20, 2022 •

edited

Loading