You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This list is chapter 6 of the Arm C Language Extensions for SVE document, which are the required functions for SVE; there are also some optional functions for SVE, and required and optional functions for SVE2, so eventually there will be 4 issues total.
6.1 Introduction
6.2 Loads
6.2.1 LD1: Unextended load
6.2.2 LD1SB: Load 8-bit data and sign-extend
6.2.3 LD1UB: Load 8-bit data and zero-extend
6.2.4 LD1SH: Load 16-bit data and sign-extend
6.2.5 LD1UH: Load 16-bit data and zero-extend
6.2.6 LD1SW: Load 32-bit data and sign-extend
6.2.7 LD1UW: Load 32-bit data and zero-extend
6.2.8 LD1RQ: Unextended load and replicate to quadword
6.2.9 LDFF1: Unextended load, first-faulting
6.2.10 LDFF1SB: Load 8-bit data and sign-extend, first-faulting
6.2.11 LDFF1UB: Load 8-bit data and zero-extend, first-faulting
6.2.12 LDFF1SH: Load 16-bit data and sign-extend, first-faulting
6.2.13 LDFF1UH: Load 16-bit data and zero-extend, first-faulting
6.2.14 LDFF1SW: Load 32-bit data and sign-extend, first-faulting
6.2.15 LDFF1UW: Load 32-bit data and zero-extend, first-faulting
6.2.16 LDNF1: Unextended load, non-faulting
6.2.17 LDNF1SB: Load 8-bit data and sign-extend, non-faulting
6.2.18 LDNF1UB: Load 8-bit data and zero-extend, non-faulting
6.2.19 LDNF1SH: Load 16-bit data and sign-extend, non-faulting
6.2.20 LDNF1UH: Load 16-bit data and zero-extend, non-faulting
6.2.21 LDNF1SW: Load 32-bit data and sign-extend, non-faulting
6.2.22 LDNF1UW: Load 32-bit data and zero-extend, non-faulting
6.2.23 LDNT1: Unextended load, non-temporal
6.2.24 LD2: Load two-element structures into two vectors
6.2.25 LD3: Load three-element structures into three vectors
6.2.26 LD4: Load four-element structures into four vectors
6.3 Stores
6.3.1 ST1: Store one vector, with no truncation
6.3.2 ST1B: Store one vector, truncating to 8 bits
6.3.3 ST1H: Store one vector, truncating to 16 bits
6.3.4 ST1W: Store one vector, truncating to 32 bits
6.3.5 STNT1: Store one vector, with no truncation, non-temporal
6.3.6 ST2: Store two vectors into two-element structures
6.3.7 ST3: Store three vectors into three-element structures
6.3.8 ST4: Store four vectors into four-element structures
6.4 Prefetches
6.4.1 PRFB: Prefetch 8-bit data
6.4.2 PRFH: Prefetch 16-bit data
6.4.3 PRFW: Prefetch 32-bit data
6.4.4 PRFD: Prefetch 64-bit data
6.5 Address calculations
6.5.1 ADRB: Compute vector address for 8-bit data
6.5.2 ADRH: Compute vector address for 16-bit data
6.5.3 ADRW: Compute vector address for 32-bit data
6.5.4 ADRD: Compute vector address for 64-bit data
6.6 Scalar to vector operations
6.6.1 DUP: Duplicate scalar value (all done except _bf16 and _x (setting inactive to unknown) versions)
6.6.2 DUPQ: Duplicate scalars to every quadword of a vector
6.6.3 INDEX: Create index series
6.7 Integer arithmetic
6.7.1 ADD: Modular integer addition
6.7.2 QADD: Saturating integer addition
6.7.3 SUB: Modular integer subtraction (in progress, 40 of 60)
This list is chapter 6 of the Arm C Language Extensions for SVE document, which are the required functions for SVE; there are also some optional functions for SVE, and required and optional functions for SVE2, so eventually there will be 4 issues total.
_bf16
and_x
(setting inactive to unknown) versions)_bf6
and_b
versions)_bf16
versions; no direct tests)The text was updated successfully, but these errors were encountered: