You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
steps/make_mfcc_pitch.sh --nj 64 --mfcc-config conf/mfcc_hires.conf --cmd run.pl data/test
utils/validate_data_dir.sh: Successfully validated data-directory data/test
steps/make_mfcc_pitch.sh: [info]: no segments file exists: assuming wav.scp indexed by utterance.
steps/make_mfcc_pitch.sh: It seems not all of the feature files were successfully procesed (4379 != 4596); consider using utils/fix_data_dir.sh data/test
steps/make_mfcc_pitch.sh: Succeeded creating MFCC and pitch features for test
steps/compute_cmvn_stats.sh data/test
Succeeded creating CMVN stats for test
fix_data_dir.sh: kept 4379 utterances out of 4596
fix_data_dir.sh: old files are kept in data/test/.backup
The text was updated successfully, but these errors were encountered:
Every time the number of nj is modified, the number of features that can be generated is also constantly changing. Different numbers of nj have different number of features. Why is this? The audio format is not corrupted
That command would have created log files that would have warnings about any problems, use
find . -name '*.log' -mtime -2 --print
as an example command to find such files.
I don't know how you think it's possible that Kaldi would treat recordings of ethnic minorities differently than the ethnic majority-- how would it know? These features just relate to the frequency spectrum.
Hello, I'm glad you can answer my question, my problem has now been solved, not the problem of kaldi, nor the problem of the pronunciation quality of the ethnic minorities, the main thing is that my two audios (two words) are problematic, which led to this problem, after solving this problem, all the characteristics are solved.
------------------ 原始邮件 ------------------
发件人: "Daniel ***@***.***>;
发送时间: 2023年11月10日(星期五) 下午4:29
收件人: ***@***.***>;
抄送: ***@***.***>; ***@***.***>;
主题: Re: [kaldi-asr/kaldi] Does Kaldi fail to extract features from the non-standard pronunciation of ethnic minorities? (Issue #4871)
That command would have created log files that would have warnings about any problems, use
find . -name '*.log' -mtime -2 --print
as an example command to find such files.
I don't know how you think it's possible that Kaldi would treat recordings of ethnic minorities differently than the ethnic majority-- how would it know? These features just relate to the frequency spectrum.
—
Reply to this email directly, view it on GitHub, or unsubscribe.
You are receiving this because you authored the thread.Message ID: ***@***.***>
steps/make_mfcc_pitch.sh --nj 64 --mfcc-config conf/mfcc_hires.conf --cmd run.pl data/test
utils/validate_data_dir.sh: Successfully validated data-directory data/test
steps/make_mfcc_pitch.sh: [info]: no segments file exists: assuming wav.scp indexed by utterance.
steps/make_mfcc_pitch.sh: It seems not all of the feature files were successfully procesed (4379 != 4596); consider using utils/fix_data_dir.sh data/test
steps/make_mfcc_pitch.sh: Succeeded creating MFCC and pitch features for test
steps/compute_cmvn_stats.sh data/test
Succeeded creating CMVN stats for test
fix_data_dir.sh: kept 4379 utterances out of 4596
fix_data_dir.sh: old files are kept in data/test/.backup
The text was updated successfully, but these errors were encountered: