Adding temperature scaling on Joiner logits: #789

KarelVesely84 · 2024-04-18T14:44:14Z

T is configurable
so far best result NCE 0.122 (still not so high)
- the BPE scores were rescaled with 0.2 (but then also incorrect words get high confidence, visually reasonable histograms are for 0.5 scale)
- BPE->WORD score merging done by min(.) function (tried also prob-product, and also arithmetic, geometric, harmonic mean)
without temperature scaling (i.e. scale 1.0), the best NCE was 0.032 (here product merging was the best)

Results seem consistent with: https://arxiv.org/abs/2110.15222

Everything tuned on a very small set of 100 sentences with 813 words and 10.2% WER, a Czech model.

I also experimented with blank posteriors mixed into the BPE confidences, but no NCE improvement found, so not pushing that.

Temperature scling added also to the Greedy search confidences.

- T hard-coded to 2.0 - so far best result NCE 0.122 (still not so high) - the BPE scores were rescaled with 0.2 (but then also incorrect words get high confidence, visually reasonable histograms are for 0.5 scale) - BPE->WORD score merging done by min(.) function (tried also prob-product, and also arithmetic, geometric, harmonic mean) - without temperature scaling (i.e. scale 1.0), the best NCE was 0.032 (here product merging was best) Results seem consistent with: https://arxiv.org/abs/2110.15222 Everything tuned on a very-small set of 100 sentences with 813 words and 10.2% WER, a Czech model. I also experimented with blank posteriors mixed into the BPE confidences, but no NCE improvement found, so not pushing that. Temperature scling added also to the Greedy search confidences.

csukuangfj · 2024-04-19T07:25:47Z

sherpa-onnx/csrc/online-transducer-modified-beam-search-decoder.cc

+
+    // copy raw logits, apply temperature-scaling (for confidences)
+    int32_t p_logit_items = vocab_size * num_hyps;
+    std::vector<float> logit_with_temperature(p_logit_items);


Can we change p_logit in-place?

no, it cannot be done in-place

the idea is to apply temperature only for computation of confidences,
the decoding continues to use the original values

this is why the logit values are copied to a new buffer

Thanks for the explanation.

csukuangfj · 2024-04-19T07:26:21Z

Thanks!

T hard-coded to 2.0 (is it okay ?)

Could you make it configurable and give it a default value 1.0 (like what we are doing for blank penalty)?
If it is 1.0, then the loop is skipped.

KarelVesely84 · 2024-04-22T09:44:50Z

Thanks!

T hard-coded to 2.0 (is it okay ?)

Could you make it configurable and give it a default value 1.0 (like what we are doing for blank penalty)? If it is 1.0, then the loop is skipped.

okay, working on it

i am not sure about the default 1.0, for 1.0 the confidences have worse quality than for 2.0,
and, as it is implemented, setting it to 2.0 will not affect the decoding at all, just the confidence
values will be different, to proposing to set it to 2.0
(same value appears in the Google article for NN-posterios confidences)

csukuangfj · 2024-04-22T09:50:19Z

1.0 is for backward compatibility.

as it is implemented, setting it to 2.0 will not affect the decoding at all,

In that case, 2.0 is fine with me.

KarelVesely84 · 2024-04-22T13:37:02Z

okay, the T parameter is now configurable

KarelVesely84 · 2024-04-22T14:08:31Z

there seems to be some problem with the workload tests, many fail with the 503 error pointing to huggingface URL

otherwise, it should be ready for a code reivew (tested with a local client, and it works as expected)

KarelVesely84 · 2024-04-23T08:05:44Z

i fixed an error in the android build

KarelVesely84 · 2024-04-24T02:23:59Z

the tests look OK, seeing only unrelated errors

csukuangfj · 2024-04-26T01:44:19Z

Thank you for your contribution!

csukuangfj reviewed Apr 19, 2024

View reviewed changes

making temperature_scale configurable from outside

8b57f73

KarelVesely84 force-pushed the confidence_temperature_scaling branch from ce6e5b5 to 8b57f73 Compare April 23, 2024 08:03

csukuangfj merged commit 2e45d32 into k2-fsa:master Apr 26, 2024
186 of 224 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding temperature scaling on Joiner logits: #789

Adding temperature scaling on Joiner logits: #789

KarelVesely84 commented Apr 18, 2024 •

edited

Loading

csukuangfj Apr 19, 2024

KarelVesely84 Apr 22, 2024

csukuangfj Apr 22, 2024

csukuangfj commented Apr 19, 2024

KarelVesely84 commented Apr 22, 2024

csukuangfj commented Apr 22, 2024

KarelVesely84 commented Apr 22, 2024

KarelVesely84 commented Apr 22, 2024 •

edited

Loading

KarelVesely84 commented Apr 23, 2024

KarelVesely84 commented Apr 24, 2024

csukuangfj commented Apr 26, 2024

Adding temperature scaling on Joiner logits: #789

Adding temperature scaling on Joiner logits: #789

Conversation

KarelVesely84 commented Apr 18, 2024 • edited Loading

csukuangfj Apr 19, 2024

Choose a reason for hiding this comment

KarelVesely84 Apr 22, 2024

Choose a reason for hiding this comment

csukuangfj Apr 22, 2024

Choose a reason for hiding this comment

csukuangfj commented Apr 19, 2024

KarelVesely84 commented Apr 22, 2024

csukuangfj commented Apr 22, 2024

KarelVesely84 commented Apr 22, 2024

KarelVesely84 commented Apr 22, 2024 • edited Loading

KarelVesely84 commented Apr 23, 2024

KarelVesely84 commented Apr 24, 2024

csukuangfj commented Apr 26, 2024

KarelVesely84 commented Apr 18, 2024 •

edited

Loading

KarelVesely84 commented Apr 22, 2024 •

edited

Loading