Remove shift of signed int in src_float_to_short_array #86

Flamefire · 2019-08-09T17:27:31Z

Scale by -SHORT_MIN
Clip if float is outside range of short
Otherwise round and truncate (guaranteed to fit to short)

Uses float instead of double as float can fit all 16 bit values.

Better alternative to approaches like #85, also adds test that show where e.g. janstary@ec07760 would fail.

Omits the clipping optimization which is normally disabled on x64 due to long being 64bits.

Scale by -SHORT_MIN Clip if float is outside range of short Otherwise round and truncate (guaranteed to fit to short) Uses float instead of double as float can fit all 16 bit values

Flamefire · 2019-08-11T09:31:41Z

After this was merged: Would it make sense to do the same for src_float_to_int_array? While it does not have the saved instruction when clipping optimization is disabled it would still look cleaner/more readable and avoid another configure check.

Sidenote: Unfortunately using lrint is a pessimization on remotely modern CPUs.:

lrint is required to handle overflow errors by setting ERRNO: http://www.cplusplus.com/reference/cmath/lrint/
without -fno-math-errno lrint won't get inlined but compiled to a library call due to the above making it slow
"modern" CPUs support the truncating rounding conversion of float/double->int in a single instruction

Docu: http://www.cplusplus.com/reference/cmath/lrint/
Experiment: https://godbolt.org/z/l77Mvt

erikd · 2019-08-11T09:55:31Z

How do we get the "truncating rounding conversion of float/double->int in a single instruction"?

Flamefire · 2019-08-11T10:32:48Z

Simply use a plain cast. See the godbolt link for resulting assembly in comparison

erikd · 2019-08-11T14:39:05Z

Does a plain cast have the same rounding behavior as lrint? If it does not, its not a replacement.

Flamefire · 2019-08-11T15:48:51Z

Yes and no. A cast is truncating so the same as "round to zero". So if the rounding mode is set to FE_TOWARDZERO then lrint does the same. Otherwise a "round to nearest" can be emulated by (x>0) ? x+.5f : x-.5f; which clang optimizes to a branch-less version on x86_64: https://godbolt.org/z/leAb8H

Note however that this "round to nearest" is not guaranteed when you use lrint.

A good read with comparisons on different archs: https://stackoverflow.com/a/37624488/1930508

Flamefire added 2 commits August 9, 2019 18:50

Remove shift of signed int in src_float_to_short_array

feb069a

Scale by -SHORT_MIN Clip if float is outside range of short Otherwise round and truncate (guaranteed to fit to short) Uses float instead of double as float can fit all 16 bit values

Add corner cases to tests

ed155a1

erikd approved these changes Aug 11, 2019

View reviewed changes

erikd merged commit 9225471 into libsndfile:master Aug 11, 2019

Flamefire deleted the shift_fix branch August 11, 2019 08:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove shift of signed int in src_float_to_short_array #86

Remove shift of signed int in src_float_to_short_array #86

Flamefire commented Aug 9, 2019

Flamefire commented Aug 11, 2019

erikd commented Aug 11, 2019

Flamefire commented Aug 11, 2019

erikd commented Aug 11, 2019

Flamefire commented Aug 11, 2019 •

edited

Loading

Remove shift of signed int in src_float_to_short_array #86

Remove shift of signed int in src_float_to_short_array #86

Conversation

Flamefire commented Aug 9, 2019

Flamefire commented Aug 11, 2019

erikd commented Aug 11, 2019

Flamefire commented Aug 11, 2019

erikd commented Aug 11, 2019

Flamefire commented Aug 11, 2019 • edited Loading

Flamefire commented Aug 11, 2019 •

edited

Loading