Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unable to decode using canary 1b model #10680

Open
uni-saurabh-vyas opened this issue Sep 30, 2024 · 1 comment
Open

Unable to decode using canary 1b model #10680

uni-saurabh-vyas opened this issue Sep 30, 2024 · 1 comment
Assignees
Labels
bug Something isn't working

Comments

@uni-saurabh-vyas
Copy link

Describe the bug

A clear and concise description of what the bug is.

I am using nemo inside docker environment, with single GPU, unable to get the ASR ouput using canary 1b model

Error executing job with overrides: ['pretrained_name=nvidia/canary-1b', 'audio_dir=/opt/workspace/es_test', 'output_filename=canary_output.txt', 'clean_groundtruth_text=True', 'langid=es', 'batch_size=1', 'compute_timestamps=False', 'compute_langs=False', 'cuda=0', 'amp=True', 'append_pred=False', 'pred_name_postfix=canary']
Traceback (most recent call last):
  File "/opt/asr_ensemble/repo/nemo/examples/asr/transcribe_speech.py", line 290, in main
    asr_model.change_decoding_strategy(cfg.ctc_decoding)
  File "/usr/local/lib/python3.10/dist-packages/nemo/collections/asr/models/aed_multitask_models.py", line 203, in change_decoding_strategy
    self.decoding = MultiTaskDecoding(
  File "/usr/local/lib/python3.10/dist-packages/nemo/collections/asr/parts/submodules/multitask_decoding.py", line 377, in __init__
    super().__init__(
  File "/usr/local/lib/python3.10/dist-packages/nemo/collections/asr/parts/submodules/multitask_decoding.py", line 124, in __init__
    raise NotImplementedError("Greedy decoding is not implemented yet.")
NotImplementedError: Greedy decoding is not implemented yet.

Steps/Code to reproduce bug

python3 nemo/examples/asr/transcribe_speech.py \
pretrained_name="nvidia/canary-1b" \
audio_dir="/opt/workspace/es_test" \
output_filename="canary_output.txt" \
clean_groundtruth_text=True \
langid='es' \
batch_size=1 \
compute_timestamps=False \
compute_langs=False \
cuda=0 \
amp=True \
append_pred=False \
pred_name_postfix="canary"

Expected behavior

A clear and concise description of what you expected to happen.

Environment overview (please complete the following information)

  • Environment location: [Bare-metal, Docker, Cloud(specify cloud provider - AWS, Azure, GCP, Collab)]
  • Method of NeMo install: [pip install or from source]. Please specify exact commands you used to install.
  • If method of install is [Docker], provide docker pull & docker run commands used

Environment details

If NVIDIA docker image is used you don't need to specify these.
Otherwise, please provide:

  • OS version Ubuntu 20
  • PyTorch version
  • Python version 3.10

Additional context

Add any other context about the problem here.
Example: GPU model

@uni-saurabh-vyas uni-saurabh-vyas added the bug Something isn't working label Sep 30, 2024
@pzelasko
Copy link
Collaborator

pzelasko commented Oct 8, 2024

Which NeMo version is this? Greedy decoding for Canary should work if you install the latest 2.0 pre-release, or the latest main branch.

@pzelasko pzelasko self-assigned this Oct 8, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants