The differents among versions of funasr-1.x.x and funasr-0.x.x #1319
Replies: 1 comment
-
For me, the previous Mandarin-only model (aishell2-vocab5212) was better for non-standard Mandarin. It is also 4-5x faster during inference. Example: In the AISHELL-3 test set, generated (SSB06930005.txt -- please rename file to .wav to listen) I forced all models to produce Chinese by setting decoder_out = -∞ for non-Hanzi tokens.
However, the old pipeline method does not work anymore. Therefore, I have mapped the configs over: Just unzip and copy it into the |
Beta Was this translation helpful? Give feedback.
-
FunASR
To run without errors, the versions of
modelscope
,funasr
andmodel params
should follows:We recommend the usage of
AutoModel
(recommend):More examples could be found in docs
If you still want to use the pipeline of
modelscope
:legacy
,not recommend
):The old version is
no longer in maintenance
:3.1. In the
latest version
offunasr>=1.0.3
andmodelscope>=1.11.1
, you could download the model params by:a. automatically download by funasr (default):
(
Notes
: Both latest and old version are supported. In the latest version (funasr>=1.0.3), you should add themodel_revision
. And you could not add it in the old version (funasr-0.8.8), otherwise it would run with errors.)When you run the code above, it would check whether
model
is a local path or model name.If the
model
is the local path, it would skip the downloading.If the
model
is model name from model zoo, it would automatically download the model params from zoos.b.
git clone
manually (only in the lates version
):Notes
: Only usegit clone
in the latest version (funasr>=1.0.3). If your version if funasr-0.8.3, it would run with errors.You could download the model params by
git clone
, for example:Then you could set the
model
to the local path you downloaded.3.2. In the
latest version
offunasr>=1.0.3
andmodelscope>=1.11.1
, the input name isinput
:or
But in the old version, the input name is
audio_in
:3.3. In the
latest version
offunasr>=1.0.3
andmodelscope>=1.11.1
, the output result islist
:But in the old version, the output result is
dict
:3.4. In the
latest version
offunasr>=1.0.3
andmodelscope>=1.11.1
, thebatch_size
:If you inference without
vad_model
, thebatch_size
refer to numbers of audio files:(
Notes
: both latest and old version are support)or
If you inference with
vad_model
, thebatch_size_s
refer to the total duration of audio file in seconds (s):or
But in the old version, it is the
batch_size_token
:Beta Was this translation helpful? Give feedback.
All reactions