-
Notifications
You must be signed in to change notification settings - Fork 2.5k
Issues: NVIDIA/NeMo
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Converting Mamba to tp4: RuntimeError: The size of tensor a (18560) must match the size of tensor b (4640) at non-singleton dimension 0
bug
Something isn't working
#10966
opened Oct 21, 2024 by
zixianwang2022
NeMO dependency issues on HuggingFace Hub (for ASR models)
bug
Something isn't working
#10940
opened Oct 18, 2024 by
bhavnicksm
NeMo2.0 nemorun llm export ValueError: PyTorch DDP is not enabled for mcore optimizer
bug
Something isn't working
#10939
opened Oct 18, 2024 by
lifeiteng
The SDXL Infer output image is full of noise
bug
Something isn't working
#10938
opened Oct 18, 2024 by
blacklong28
Modules fail for Dreambooth example
bug
Something isn't working
#10888
opened Oct 15, 2024 by
paulaserna16
Converting trained llama 2 checkpoint to hf gives "invalid key" error
bug
Something isn't working
#10884
opened Oct 14, 2024 by
jiaji-huang
SFT stage use context parallel with flash attention error
bug
Something isn't working
#10876
opened Oct 14, 2024 by
ARQlalala
Allow OOMtimizer tokenizer point towards just parent directory
#10870
opened Oct 13, 2024 by
tbartley94
[NeVa Pretraining] Vision Encoder Created on All GPUs During Pipeline Parallelism
#10805
opened Oct 8, 2024 by
Esthesia
When training ASR models, it saves .nemo 2 times in a row
bug
Something isn't working
#10798
opened Oct 8, 2024 by
AudranBert
Resuming from a checkpoint that ended before the epoch ended and your dataloader is not resumable
bug
Something isn't working
#10797
opened Oct 8, 2024 by
AudranBert
Unable to merge lora weights: "world_size (1) is not divisible by 4"
bug
Something isn't working
#10782
opened Oct 7, 2024 by
Elan456
IPython
should be included in the requirements
bug
#10772
opened Oct 5, 2024 by
MahmoudAshraf97
Loading 70B model from .nemo checkpoint takes very long time
#10745
opened Oct 3, 2024 by
jiaji-huang
Global shape mismatch for loaded ((1024, 768)) and expected ((512, 768)) tensor for key model.embedding.position_embeddings.weight
bug
Something isn't working
#10715
opened Oct 2, 2024 by
Alireza3242
Using MSDD model with a different speaker embedding model
#10681
opened Sep 30, 2024 by
MahmoudAshraf97
Unable to decode using canary 1b model
bug
Something isn't working
#10680
opened Sep 30, 2024 by
uni-saurabh-vyas
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.