Add Kaldi Pitch feature #1243

mthrok · 2021-02-05T19:38:18Z

This PR adds Kaldi Pitch Feature detailed in "A pitch extraction algorithm tuned for automatic speech recognition".

The interface is mostly same as compute-kaldi-pitch-feats CLI. Batch support is added via at::parallel_for function.

As the function binds the custom built libkaldi, it only supports CPU (and float32) at the moment.

About the custom built Kaldi

Since kaldi is a large library, I choose to use only a subset of it by adding custom build process. In addition to that, to reduce the dependencies and make the build process, I reused the BLAS package PyTorch uses. For this, I added custom interface of Kaldi's matrix libraries. Therefor the algorithm runs on torch::Tensor class. However, some parts of the algorithms requires direct access to memory, so the resulting function is not differentiable. Also the resulting code is very slow (60x) at the moment due to the overhead of slicing operation. (Kaldi's feature implementations work element-wise, while PyTorch operates faster when operations are vectorized.)

Supersedes #1063

cpuhrsch · 2021-02-09T21:37:31Z

third_party/kaldi/src/matrix/kaldi-vector.h

+    auto mat = M.tensor_;
+    if (trans == kNoTrans) {
+      tensor_ =
+          beta * tensor_ + torch::diag(torch::mm(mat, mat.transpose(1, 0)));


If this is called in a tight-loop with small Tensors, you might fare better inlining the operation and using the underlying tensor data pointer instead of calling into torch::mm repeatedly.

Also note that torch::mm does parallelization, but at::parallel_for disables nested parallelism for OpenMP (to avoid oversubscription), which is the default threadpool for PyTorch.

facebook-github-bot added the CLA Signed label Feb 5, 2021

mthrok added this to the v0.8 milestone Feb 5, 2021

mthrok force-pushed the rebase-pitch-feature branch 9 times, most recently from d9ebbd3 to 80f234d Compare February 8, 2021 19:24

This was referenced Feb 8, 2021

Add ComputeKaldiPitch prototype #1063

Closed

List only one item par line in __all__ #1250

Merged

mthrok force-pushed the rebase-pitch-feature branch 4 times, most recently from ae26d83 to e1ca9ee Compare February 9, 2021 15:52

cpuhrsch reviewed Feb 9, 2021

View reviewed changes

cpuhrsch approved these changes Feb 9, 2021

View reviewed changes

mthrok marked this pull request as ready for review February 9, 2021 22:03

mthrok added 5 commits February 9, 2021 22:04

Run style check on custom code

bc2ecaa

Add Kaldi Pitch Feature

a5ae47c

Guard test

11847a4

Add README

3862160

fixup! Add README

b158e7c

mthrok force-pushed the rebase-pitch-feature branch from e1ca9ee to b158e7c Compare February 9, 2021 22:22

mthrok added 2 commits February 9, 2021 22:28

fixup! fixup! Add README

50ed7f2

fixup! fixup! fixup! Add README

fe2db28

mthrok merged commit 7ee1c46 into pytorch:master Feb 9, 2021

mthrok deleted the rebase-pitch-feature branch February 9, 2021 23:46

mthrok mentioned this pull request Feb 15, 2021

RFC: The future of Kaldi compliance module #1269

Open

mthrok mentioned this pull request Mar 3, 2021

🚀 Feature Request: Add Kaldi Pitch Feature #686

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Kaldi Pitch feature #1243

Add Kaldi Pitch feature #1243

mthrok commented Feb 5, 2021 •

edited

Loading

cpuhrsch Feb 9, 2021

Add Kaldi Pitch feature #1243

Add Kaldi Pitch feature #1243

Conversation

mthrok commented Feb 5, 2021 • edited Loading

About the custom built Kaldi

cpuhrsch Feb 9, 2021

Choose a reason for hiding this comment

mthrok commented Feb 5, 2021 •

edited

Loading