How to contribute to ESPnet

If you are interested in contributing to ESPnet, your contributions will fall into three categories:

If you want to propose a new feature and implement it, please post about your intended feature at the issues, or you can contact Shinji Watanabe shinjiw@ieee.org or other main developers. We shall discuss the design and implementation. Once we agree that the plan looks good, go ahead and implement it. You can find ongoing major development plans at https://github.com/espnet/espnet/milestones
If you want to implement a minor feature or bug-fix for an issue, please first take a look at the existing issues (https://github.com/espnet/espnet/pulls) and/or pull requests (https://github.com/espnet/espnet/pulls). Pick an issue and comment on the task that you want to work on this feature. If you need more context on a particular issue, please ask us and then we shall provide more information.

We also welcome if you find some bugs during your actual use of ESPnet and make a PR to fix them.
ESPnet provides and maintains a lot of reproducible examples similar to Kaldi (called recipe). The recipe creation/update/bug-fix is one of our major development items, and we really encourage you to work on it. When you port a Kaldi recipe to ESPnet, see https://github.com/espnet/espnet/wiki/How-to-port-the-Kaldi-recipe-to-the-ESPnet-recipe%3F

We also encourage you to report your results with your detailed environmental info and upload the model for the reproducibility (e.g., see https://github.com/espnet/espnet/blob/master/egs/tedlium2/asr1/RESULTS.md).

To make a report for RESULTS.md
- execute show_result.sh at a recipe main directory (where run.sh is located), as follows. You'll get environmental information and the evaluation result of each experiments in a markdown format.
```
$ show_result.sh
```
- execute pack_model.sh at a recipe main directory as follows. You'll get model information in a markdown format
```
$ pack_model.sh --lm <language model> <tr_conf> <dec_conf> <cmvn> <e2e>
```
- pack_model.sh also produces a packed espnet model (model.tar.gz). If you upload this model to somewhere with a download link, please put the link information to RESULTS.md.
- please contact Shinji Watanabe shinjiw@ieee.org if you want a web storage to put your model files.

Once you finish implementing a feature or bugfix, please send a Pull Request to https://github.com/espnet/espnet

If you are not familiar with creating a Pull Request, here are some guides:

http://stackoverflow.com/questions/14680711/how-to-do-a-github-pull-request
https://help.github.com/articles/creating-a-pull-request/

Version policy and development branches

We basically develop in the master branch.

We will keep the first version digit 0 until we have some super major changes in the project organization level.
The second version digit will be updated when we have major updates including new functions and refactoring, and their related bug fix and recipe changes. This version update will be done roughly at every half year so far (but it depends on the development plan).
The third version digit will be updated when we fix serious bugs or accumulate some minor changes including recipe related changes periodically (every two months or so).

Unit testing

ESPnet's testing is located under test/. You can install additional packages for testing as follows:

$ cd <espnet_root>
$ pip install -e ".[test]"

Python

Then you can run the entire test suite using flake8, autopep8, black and pytest with coverage by

./ci/test_python.sh

To create new test file. write functions named like def test_yyy(...) in files like test_xxx.py under test/. Pytest will automatically test them. Note that, pytest-timeouts raises an error when any tests exceed 2.0 sec. To keep unittests fast, please avoid large parameters, dynamic imports, and file access. If your unittest really needs more time, you can annotate your test function with @pytest.mark.timeout(sec). You can find pytest fixtures in test/conftest.py. They finalize unit tests.

Bash scripts

You can also test the scripts in utils with bats-core and shellcheck.

To test:

./ci/test_bash.sh

Integration testing

Write new integration tests in ci/test_integration.sh when you add new features in espnet/bin. They use our smallest dataset egs/mini_an4 to test run.sh. To make the coverage take them into account, don't forget --python ${python} support in your run.sh

# ci/integration_test.sh

python="coverage run --append"

cd egs/mini_an4/your_task
./run.sh --python "${python}"

Configuration files

setup.cfg configures pytest, black and flake8.
.travis.yml configures Travis-CI (unittests, doc deploy).
.circleci/config.yml configures Circle-CI (unittests, integration tests).
.github/workflows configures Github Actions (unittests, integration tests).

Writing new tools

You can place your new tools under

espnet/bin: heavy and large (e.g., neural network related) core tools.
utils: lightweight self-contained python/bash scripts.

For utils scripts, do not forget to add help messages, and test scripts under test_utils.

Python tools guideline

To generate doc, do not forget def get_parser(): -> ArgumentParser in the main file.

#!/usr/bin/env python3
# Copyright XXX
#  Apache 2.0  (http://www.apache.org/licenses/LICENSE-2.0)
import argparse

# NOTE: do not forget this
def get_parser():
    parser = argparse.ArgumentParser(
        description="awsome tool",  # DO NOT forget this
    )
    ...
    return parser

if __name__ == '__main__':
    args = get_parser().parse_args()
    ...

Bash tools guideline

To generate doc, support --help to show its usage. If you use Kaldi's utils/parse_option.sh, define help_message="Usage: $0 ...".

Writing documentation

See doc.

Adding pretrained models

Pack your trained models using utils/pack_model.sh and upload it here (You require permission). Add the shared link to utils/recog_wav.sh or utils/synth_wav.sh as follows:

    "tedlium.demo") share_url="https://drive.google.com/open?id=1UqIY6WJMZ4sxNxSugUqp3mrGb3j6h7xe" ;;

The model name is arbitrary for now.

On CI failure

Travis CI and Github Actions

read log from PR checks > details

Circle CI

read log from PR checks > details
turn on Rerun workflow > Rerun job with SSH
open your local terminal and ssh -p xxx xxx (check circle ci log for the exact address)
try anything you can to pass the CI

Codecov

write more tests to increase coverage
explain to reviewers why you can't increase coverage

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CONTRIBUTING.md

CONTRIBUTING.md

How to contribute to ESPnet

Version policy and development branches

Unit testing

Python

Bash scripts

Integration testing

Configuration files

Writing new tools

Python tools guideline

Bash tools guideline

Writing documentation

Adding pretrained models

On CI failure

Travis CI and Github Actions

Circle CI

Codecov

Files

CONTRIBUTING.md

Latest commit

History

CONTRIBUTING.md

File metadata and controls

How to contribute to ESPnet

Version policy and development branches

Unit testing

Python

Bash scripts

Integration testing

Configuration files

Writing new tools

Python tools guideline

Bash tools guideline

Writing documentation

Adding pretrained models

On CI failure

Travis CI and Github Actions

Circle CI

Codecov