Support multilingual whisper models #274

csukuangfj · 2023-08-15T14:43:11Z

Note you can use

--whisper-language

to specify the spoken language in the input audio file or leave it empty to let the code infer the language from the input file

and

--whisper-task=transcribe or --whisper-task=translate

to do transcribe or translate.

csukuangfj · 2023-08-16T02:41:16Z

FYI:
There is a huggingface space to show Next-gen Kaldi + Whisper models for speech recognition. Please see
https://huggingface.co/spaces/k2-fsa/automatic-speech-recognition-with-whisper

pukatana · 2023-12-23T03:13:28Z

--whisper-language can be used on Android ASR?

csukuangfj · 2023-12-23T04:40:37Z

--whisper-language can be used on Android ASR?

You can use it in the code. It is not exposed to users via the UI.

pukatana · 2023-12-23T06:01:11Z

--whisper-language can be used on Android ASR?

You can use it in the code. It is not exposed to users via the UI.
Hi @csukuangfj ,
Where can I use that? I couldn't find the code in JNI and android.
Could you give me some tips?

csukuangfj · 2023-12-23T06:08:28Z

--whisper-language can be used on Android ASR?

You can use it in the code. It is not exposed to users via the UI.
Hi @csukuangfj ,
Where can I use that? I couldn't find the code in JNI and android.
Could you give me some tips?

You can follow the way about how to add encoder and decoder to add the language option.

pukatana · 2023-12-23T07:29:14Z

@csukuangfj , Thanks for your kind reply.
I found the options, but some problems to build the JNI.
If you have time, please update the JNI libs with language options.

csukuangfj · 2023-12-23T07:30:12Z

please show error logs of your problem.

pukatana · 2023-12-23T07:37:16Z

I'm trying to build the project under the company's proxy.
So, there is a simple error.
error: downloading 'https://github.com/kkm000/openfst/archive/refs/tags/win/1.6.5.1.tar.gz' failed when build the project on Android Studio.
I'd be grateful if you provide the updated libs.

pukatana · 2023-12-23T08:05:56Z

When I try using downloaded files, I got this error.

FAILED: openfst-populate-prefix/src/openfst-populate-stamp/openfst-populate-patch D:/shepra/sherpa-onnx-master/android/SherpaOnnxVadAsr/app/.cxx/Debug/2g631h5d/x86/_deps/openfst-subbuild/openfst-populate-prefix/src/openfst-populate-stamp/openfst-populate-patch
cmd.exe /C "cd /D D:\shepra\sherpa-onnx-master\android\SherpaOnnxVadAsr\app.cxx\Debug\2g631h5d\x86_deps\openfst-src && sed -i.bak s/enable_testing()//g src/CMakeLists.txt && sed -i.bak s/add_subdirectory(test)//g src/CMakeLists.txt && sed -i.bak /message/d src/script/CMakeLists.txt && D:\android\sdk\cmake\3.22.1\bin\cmake.exe -E touch D:/shepra/sherpa-onnx-master/android/SherpaOnnxVadAsr/app/.cxx/Debug/2g631h5d/x86/_deps/openfst-subbuild/openfst-populate-prefix/src/openfst-populate-stamp/openfst-populate-patch"
'sed' is not recognized as an internal or external command,
operable program or batch file.
ninja: build stopped: subcommand failed.

csukuangfj · 2023-12-23T15:40:00Z

When I try using downloaded files, I got this error.

FAILED: openfst-populate-prefix/src/openfst-populate-stamp/openfst-populate-patch D:/shepra/sherpa-onnx-master/android/SherpaOnnxVadAsr/app/.cxx/Debug/2g631h5d/x86/_deps/openfst-subbuild/openfst-populate-prefix/src/openfst-populate-stamp/openfst-populate-patch cmd.exe /C "cd /D D:\shepra\sherpa-onnx-master\android\SherpaOnnxVadAsr\app.cxx\Debug\2g631h5d\x86_deps\openfst-src && sed -i.bak s/enable_testing()//g src/CMakeLists.txt && sed -i.bak s/add_subdirectory(test)//g src/CMakeLists.txt && sed -i.bak /message/d src/script/CMakeLists.txt && D:\android\sdk\cmake\3.22.1\bin\cmake.exe -E touch D:/shepra/sherpa-onnx-master/android/SherpaOnnxVadAsr/app/.cxx/Debug/2g631h5d/x86/_deps/openfst-subbuild/openfst-populate-prefix/src/openfst-populate-stamp/openfst-populate-patch" 'sed' is not recognized as an internal or external command, operable program or batch file. ninja: build stopped: subcommand failed.

Our doc is for Linux and macOS.

If you really want to use Windows, please refer to the following colab notebook
https://github.com/k2-fsa/colab/blob/master/sherpa-onnx/build_sherpa_onnx_for_android.ipynb
to generate the required libraries and then use them in Android Studio.

pukatana · 2023-12-24T02:20:29Z

Should I build so files on Linux not using Android Studio?

csukuangfj · 2023-12-24T04:44:23Z

Should I build so files on Linux not using Android Studio?

Yes, yor are right.

But Android Studio is needed if you need to build APKs.

pukatana · 2023-12-24T04:46:37Z

Ok, I see.
I only found the language option when initialize the model config.
And is it possible to use --language option when decode the multilingual whisper model?

csukuangfj · 2023-12-24T05:36:16Z

Ok, I see. I only found the language option when initialize the model config. And is it possible to use --language option when decode the multilingual whisper model?

Yes, it is possible. If you provide --whisper-language="", i.e., if you don't specify --whisper-language at all and use its default value, then it will detect the language in the audio automatically.
If you want to specify the language at the decoding time, you have to change the code. Fortunately, it is just a tiny change.

The related code is at

sherpa-onnx/sherpa-onnx/csrc/offline-whisper-greedy-search-decoder.cc

Line 79 in e475e75

if (!config_.language.empty()) {

Instead of reading the language from the config, you can pass it as a function argument.

csukuangfj added 8 commits August 15, 2023 17:02

Fix go-api-examples and support multilingual whisper models.

6768936

support multilingual whisper models

963b299

Fix Python examples

89692c5

small fixes

3e5f4bf

Release v1.7.7

11bee47

Fix kotlin API example

243bd5c

fix style issues

aa076de

small fixes

6cc87c0

csukuangfj merged commit f709c95 into k2-fsa:master Aug 15, 2023
132 of 142 checks passed

csukuangfj deleted the whisper-multilingual branch August 15, 2023 16:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support multilingual whisper models #274

Support multilingual whisper models #274

csukuangfj commented Aug 15, 2023

csukuangfj commented Aug 16, 2023

pukatana commented Dec 23, 2023

csukuangfj commented Dec 23, 2023

pukatana commented Dec 23, 2023

csukuangfj commented Dec 23, 2023

pukatana commented Dec 23, 2023

csukuangfj commented Dec 23, 2023

pukatana commented Dec 23, 2023 •

edited

Loading

pukatana commented Dec 23, 2023

csukuangfj commented Dec 23, 2023

pukatana commented Dec 24, 2023

csukuangfj commented Dec 24, 2023 •

edited

Loading

pukatana commented Dec 24, 2023 •

edited

Loading

csukuangfj commented Dec 24, 2023

Support multilingual whisper models #274

Support multilingual whisper models #274

Conversation

csukuangfj commented Aug 15, 2023

csukuangfj commented Aug 16, 2023

pukatana commented Dec 23, 2023

csukuangfj commented Dec 23, 2023

pukatana commented Dec 23, 2023

csukuangfj commented Dec 23, 2023

pukatana commented Dec 23, 2023

csukuangfj commented Dec 23, 2023

pukatana commented Dec 23, 2023 • edited Loading

pukatana commented Dec 23, 2023

csukuangfj commented Dec 23, 2023

pukatana commented Dec 24, 2023

csukuangfj commented Dec 24, 2023 • edited Loading

pukatana commented Dec 24, 2023 • edited Loading

csukuangfj commented Dec 24, 2023

pukatana commented Dec 23, 2023 •

edited

Loading

csukuangfj commented Dec 24, 2023 •

edited

Loading

pukatana commented Dec 24, 2023 •

edited

Loading