Option to use a language model with Wav2Vec2 transcription #1312

lfcnassif · 2022-09-11T19:43:40Z

This was left as a future improvement of #1214. This should be investigated:
jonatasgrosman/huggingsound#62

lfcnassif · 2023-07-28T20:54:29Z

I still wasn't able to use the (suggested) KenshoLMDecoder implementation for a language model from huggingsound library properly to evaluate this. But I managed to use ParlanceLMDecoder implementation together with Jonatas Grosman's fine tuned wav2vec2 large portuguese model, results were not good:

lfcnassif · 2024-10-19T00:59:33Z

Since we are migrating to whisper, I'm closing this.

lfcnassif added the enhancement label Sep 11, 2022

lfcnassif mentioned this issue Jul 19, 2023

Evaluate Whisper transcription algorithm #1335

Open

lfcnassif self-assigned this Jul 28, 2023

lfcnassif removed their assignment Mar 27, 2024

lfcnassif closed this as completed Oct 19, 2024

lfcnassif added the wontfix label Oct 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Option to use a language model with Wav2Vec2 transcription #1312

Option to use a language model with Wav2Vec2 transcription #1312

lfcnassif commented Sep 11, 2022

lfcnassif commented Jul 28, 2023

lfcnassif commented Oct 19, 2024

Option to use a language model with Wav2Vec2 transcription #1312

Option to use a language model with Wav2Vec2 transcription #1312

Comments

lfcnassif commented Sep 11, 2022

lfcnassif commented Jul 28, 2023

lfcnassif commented Oct 19, 2024