support batching for kaldi compliant feature extraction functions #675

saurabh-kataria · 2020-06-01T21:03:17Z

🚀 Feature

batch dimension should be supported for kaldi complaint functions, for example, in torchaudio.compliance.kaldi.fbank

Motivation

Computation on GPU and use batches is essential

mthrok · 2020-06-01T21:34:58Z

Thanks for submitting the feature request. This sounds reasonable request, but requires careful design.
As a starting point, can you describe how you would feed batched tensor?
like shape of the input tensor (what dimensions they represent) and how the function signature would change (if change is required).

echocatzh · 2020-09-14T06:16:35Z

I also encountered this problem with compliance.kaldi.fbank. I hope torchaudio can add batch processing operations, such as limiting the input dimension to 3 dimensions, [batch, channel, samples], or adding a batch_first option, because when working on asr or kws, the batchsize is usually very Large, if only one audio can be processed at a time, will the efficiency be reduced?

mthrok added Kaldi enhancement labels Jun 1, 2020

mthrok mentioned this issue Feb 15, 2021

RFC: The future of Kaldi compliance module #1269

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support batching for kaldi compliant feature extraction functions #675

support batching for kaldi compliant feature extraction functions #675

saurabh-kataria commented Jun 1, 2020

mthrok commented Jun 1, 2020

echocatzh commented Sep 14, 2020 •

edited

Loading

support batching for kaldi compliant feature extraction functions #675

support batching for kaldi compliant feature extraction functions #675

Comments

saurabh-kataria commented Jun 1, 2020

🚀 Feature

Motivation

mthrok commented Jun 1, 2020

echocatzh commented Sep 14, 2020 • edited Loading

echocatzh commented Sep 14, 2020 •

edited

Loading