Add PrecisionFromFloat32() to return precision without performing conversion #3

x448 · 2020-01-05T05:51:53Z

It would be useful to know the precision of converting IEEE binary32 to binary16, if the function can be inlined.

PrecisionFromFloat32 should return Precision without performing the conversion.
Conversions from both Infinity and NaN values will always report PrecisionExact even
if NaN payload or NaN-Quiet-Bit is lost.

If this is too complex to be inlined by Go, then make it an extra return value as part of conversion functions.

// Precision indicates whether the conversion to Float16 is
// exact, inexact, underflow, or overflow.
type Precision int

const (
       PrecisionExact Precision = iota
       PrecisionInexact
       PrecisionUnderflow
       PrecisionOverflow
)

func PrecisionFromfloat32(f32 float32) Precision

The text was updated successfully, but these errors were encountered:

x448 closed this as completed in fc4a616 Jan 5, 2020

x448 mentioned this issue Jan 5, 2020

Add Fromfloat32ex returning both Float16 and the precision of conversion. #2

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add PrecisionFromFloat32() to return precision without performing conversion #3

Add PrecisionFromFloat32() to return precision without performing conversion #3

x448 commented Jan 5, 2020

Add PrecisionFromFloat32() to return precision without performing conversion #3

Add PrecisionFromFloat32() to return precision without performing conversion #3

Comments

x448 commented Jan 5, 2020