You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thanks for submitting the feature request. This sounds reasonable request, but requires careful design.
As a starting point, can you describe how you would feed batched tensor?
like shape of the input tensor (what dimensions they represent) and how the function signature would change (if change is required).
I also encountered this problem with compliance.kaldi.fbank. I hope torchaudio can add batch processing operations, such as limiting the input dimension to 3 dimensions, [batch, channel, samples], or adding a batch_first option, because when working on asr or kws, the batchsize is usually very Large, if only one audio can be processed at a time, will the efficiency be reduced?
🚀 Feature
batch dimension should be supported for kaldi complaint functions, for example, in torchaudio.compliance.kaldi.fbank
Motivation
Computation on GPU and use batches is essential
The text was updated successfully, but these errors were encountered: