Noise Shaping #8

phoboslab · 2023-02-07T13:38:16Z

I've added some very simple noise shaping to the encoder (to the noise_shaping branch). This does not change the decoder or the data format. The noise shaping should help to move quantization noise into the higher, less audible frequencies.

Here's a comparison page with all samples with and without noise shaping: https://phoboslab.org/files/qoa-samples/noiseshaping.html

The difference for some sample is night & day. Listen to 32_triangles-triangle_roll_stereo at 00:43 or 35_glockenspiel_arpegio_melodious_phrase_stereo at 00:39.

However, this noise shaping has an adverse effect for some other samples. I tried to contain it by only applying most of the shaping when our prediction is "bad" anyway. But still, I feel that some samples sound more "crunchy" now. Listen to 21_trumpet_arpegio_melodious_phrase_stereo right at the beginning for instance. Vocals in julien_baker_sprained_ankle and others also seem to have lost a bit of "smoothness".

Maybe someone with better ears (and/or equipment :D) can take a listen? What's the usual strategy here, to adaptively correct for quantization noise?

The text was updated successfully, but these errors were encountered:

mattdesl · 2023-02-07T18:22:44Z

Maybe something that could be optional or configurable? Some of the audio files I’m thinking of using QOA for have a lot of noise to begin with that I would like to keep more or less intact, almost treating it like data rather than audible signal. But perhaps I misunderstand what this shaping does.

I can’t hear the difference in your example page with my cheapo headphones but in Audacity comparing files with Invert it becomes more noticeable. Will need to do some more tests.

phoboslab · 2023-02-08T21:07:21Z

This should only affect the quantization noise that is added by the encoder; it won't remove any noise that is present in the source. But yes, making it optional is certainly the right idea!

p0nce · 2023-04-16T15:53:16Z

audio-formats has TDPF (courtesy of MIT-licensed Airwindows, tuned and modified by me to fit WAV) dithering in its QOA encoder now: https://github.com/AuburnSounds/audio-formats/blob/master/source/audioformats/qoa.d#L724
It was tuned for WAV. imo It's more important to get dithering levels right rather than get the best dithering. I will finetune the dither level for QOA encoding. (EDIT: errr, disabled for now, it sounds worse than without dithering)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Noise Shaping #8

Noise Shaping #8

phoboslab commented Feb 7, 2023 •

edited

Loading

mattdesl commented Feb 7, 2023 •

edited

Loading

phoboslab commented Feb 8, 2023

p0nce commented Apr 16, 2023 •

edited

Loading

Noise Shaping #8

Noise Shaping #8

Comments

phoboslab commented Feb 7, 2023 • edited Loading

mattdesl commented Feb 7, 2023 • edited Loading

phoboslab commented Feb 8, 2023

p0nce commented Apr 16, 2023 • edited Loading

phoboslab commented Feb 7, 2023 •

edited

Loading

mattdesl commented Feb 7, 2023 •

edited

Loading

p0nce commented Apr 16, 2023 •

edited

Loading