-
Notifications
You must be signed in to change notification settings - Fork 530
Issues: mosaicml/llm-foundry
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
loss.detach().clone().mean() * (microbatch_size / current_batch_size
bug
Something isn't working
#1596
opened Oct 17, 2024 by
YixinSong-e
Fine-tuning error in conda environment without docker image
bug
Something isn't working
#1538
opened Sep 21, 2024 by
LalchandPandia
When Finetuning Llama3, Error occurs
bug
Something isn't working
#1508
opened Sep 2, 2024 by
AndrewHYC
ERROR:composer.cli.launcher:Global rank 0 (PID 208865) exited with code -11
question
Further information is requested
#1501
opened Aug 30, 2024 by
AndrewHYC
Any example script to run multi-node training for slurm?
enhancement
New feature or request
#1378
opened Jul 20, 2024 by
wavy-jung
Allow multiprocessing when preparing ICL dataset
enhancement
New feature or request
#1276
opened Jun 13, 2024 by
sanjari-orb
could you give an elaborated steps about how to run llm-foundry on AMD mi250 devices
bug
Something isn't working
#1242
opened May 27, 2024 by
Alice1069
LLaMA PRO training resume problem
question
Further information is requested
#1231
opened May 23, 2024 by
germanjke
Conversion Sharded -> Monolithic checkpoint
question
Further information is requested
#1220
opened May 17, 2024 by
pretidav
Fine-tune dbrx-instruct on a single VM with 8 H100s
question
Further information is requested
#1105
opened Apr 10, 2024 by
hosseinsarshar
Composer crashes when attempting to load sharded checkpoint
bug
Something isn't working
#998
opened Feb 27, 2024 by
growlix
How to support multi-threaded parallel data preprocessing?
enhancement
New feature or request
#870
opened Jan 14, 2024 by
YixinSong-e
Any plan for supporting DPO?
enhancement
New feature or request
#846
opened Jan 8, 2024 by
lorabit110
Converted PrefixLM HF snapshot must enable cache for generation in config
bug
Something isn't working
#780
opened Dec 6, 2023 by
timsteuer
eval.py
hangs when config yaml's model hparams don't match model checkpoint hparams
bug
#755
opened Nov 21, 2023 by
growlix
Converting a composer seq2seq t5 model throws an exception
bug
Something isn't working
#754
opened Nov 21, 2023 by
timsteuer
Benchmarking GLUE tasks for in-context learning
question
Further information is requested
#707
opened Oct 31, 2023 by
ashim95
mosaicml-turbo: Where to find the repo?
question
Further information is requested
#565
opened Aug 29, 2023 by
agarvic
[Bug] Different batch_size return different evaluating result
bug
Something isn't working
#541
opened Aug 21, 2023 by
SingL3
Previous Next
ProTip!
Follow long discussions with comments:>50.