Skip to content

Releases: vxst/float8_matmul

v0.0.1

24 May 14:20
Compare
Choose a tag to compare
v0.0.1 Pre-release
Pre-release

First Release of float8_matmul

This is the first release of the float8_matmul project, featuring basic matrix multiplication for 8-bit floating-point matrices.

Key Features:

  • Basic 8-bit float matrix multiplication (matmul).
  • About 25~50% of native FP32 performance for older CUDA device

Passed test:

When:

  • Matrix A: Row x0 filled with 1.0.
  • Matrix B: Column y0 filled with 1.0.

Expected Output:

  • Result at (x0, y0) will be the matrix dimension.
  • All other positions will be 0.0.