Skip to content

Actions: NVIDIA/NeMo

.github/workflows/node-reboot.yml

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
1,205 workflow runs
1,205 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Custom FSDP path added to megatron parallel.
.github/workflows/node-reboot.yml #1207: Commit b8044db pushed by Victor49152
November 12, 2024 18:06 Failure mingyuanm/flux_controlnet
November 12, 2024 18:06 Failure
add 40b
.github/workflows/node-reboot.yml #1206: Commit 47ce70a pushed by JRD971000
November 12, 2024 15:42 Failure alit/hyena_ux
November 12, 2024 15:42 Failure
load ckpt with dist_checkpointing
.github/workflows/node-reboot.yml #1205: Commit 6aabe99 pushed by yashaswikarnati
November 12, 2024 02:30 Failure yash/mimo_ckpt_fix
November 12, 2024 02:30 Failure
add val check interval
.github/workflows/node-reboot.yml #1204: Commit 10c1124 pushed by JRD971000
November 11, 2024 17:33 Failure alit/hyena_ux
November 11, 2024 17:33 Failure
cleanup
.github/workflows/node-reboot.yml #1203: Commit 218d6e9 pushed by JRD971000
November 11, 2024 17:29 Failure alit/hyena_ux
November 11, 2024 17:29 Failure
add wandb logger
.github/workflows/node-reboot.yml #1202: Commit ede89e3 pushed by JRD971000
November 11, 2024 17:23 Failure alit/hyena_ux
November 11, 2024 17:23 Failure
add docstring
.github/workflows/node-reboot.yml #1201: Commit ea62e6a pushed by JRD971000
November 11, 2024 15:50 Failure alit/hyena_ux
November 11, 2024 15:50 Failure
Hyena wrapper: Weight decay override function (#11203)
.github/workflows/node-reboot.yml #1200: Commit fd58413 pushed by JRD971000
November 11, 2024 15:15 Failure alit/hyena_ux
November 11, 2024 15:15 Failure
mimo overfit to single example
.github/workflows/node-reboot.yml #1199: Commit 62e41b0 pushed by yashaswikarnati
November 8, 2024 20:38 Failure yash/mimo_ckpt_fix
November 8, 2024 20:38 Failure
saving
.github/workflows/node-reboot.yml #1198: Commit abb4fab pushed by lilithgrigoryan
November 7, 2024 15:02 Failure lgrigoryan/rnnt_batched_beam_search
November 7, 2024 15:02 Failure
Hyena wrapper: Weight decay override function
.github/workflows/node-reboot.yml #1197: Pull request #11203 by artbataev
November 7, 2024 08:41 Failure guyjacob:guyj/hyena_ux_wd_override
November 7, 2024 08:41 Failure
training with captioning data
.github/workflows/node-reboot.yml #1196: Commit 6793f63 pushed by yashaswikarnati
November 6, 2024 01:55 Failure yash/mimo_ckpt_fix
November 6, 2024 01:55 Failure
Flux and controlnet now could train on precached mode.
.github/workflows/node-reboot.yml #1195: Commit 3b62be0 pushed by Victor49152
November 5, 2024 23:22 Failure mingyuanm/flux_controlnet
November 5, 2024 23:22 Failure
minor fixes
.github/workflows/node-reboot.yml #1194: Commit 935dd2c pushed by JRD971000
November 5, 2024 16:07 Failure alit/hyena_ux
November 5, 2024 16:07 Failure
Update for compatibility with MCore Hyena config changes (#11162)
.github/workflows/node-reboot.yml #1193: Commit 14cc31a pushed by JRD971000
November 5, 2024 15:20 Failure alit/hyena_ux
November 5, 2024 15:20 Failure
Update for compatibility with MCore Hyena config changes
.github/workflows/node-reboot.yml #1192: Pull request #11162 by guyjacob
November 5, 2024 12:36 Failure guyjacob:guyj/hyena_ux_config
November 5, 2024 12:36 Failure
Update for compatibility with MCore Hyena config changes
.github/workflows/node-reboot.yml #1191: Pull request #11162 by artbataev
November 5, 2024 12:30 Failure guyjacob:guyj/hyena_ux_config
November 5, 2024 12:30 Failure
Update for compatibility with MCore Hyena config changes
.github/workflows/node-reboot.yml #1190: Pull request #11162 by artbataev
November 5, 2024 12:29 Failure guyjacob:guyj/hyena_ux_config
November 5, 2024 12:29 Failure
added captioning datamodule
.github/workflows/node-reboot.yml #1189: Commit bd72d97 pushed by yashaswikarnati
November 5, 2024 02:23 Failure yash/mimo_ckpt_fix
November 5, 2024 02:23 Failure
Flux training with DDP tested on 1 GPU
.github/workflows/node-reboot.yml #1188: Commit e0de704 pushed by Victor49152
November 4, 2024 23:35 Failure mingyuanm/flux_controlnet
November 4, 2024 23:35 Failure
fix activation granularity for the test model
.github/workflows/node-reboot.yml #1187: Commit c046802 pushed by JRD971000
November 4, 2024 23:34 Failure alit/hyena_ux
November 4, 2024 23:34 Failure
CP init (#11154)
.github/workflows/node-reboot.yml #1186: Commit f959da4 pushed by JRD971000
November 4, 2024 23:15 Failure alit/hyena_ux
November 4, 2024 23:15 Failure
merge cp changes with hyena_ux
.github/workflows/node-reboot.yml #1185: Commit 7964e07 pushed by JRD971000
November 4, 2024 23:15 Failure alit/hyena_ux_cp
November 4, 2024 23:15 Failure
Apply isort and black reformatting
.github/workflows/node-reboot.yml #1184: Commit 92cc949 pushed by artbataev
November 4, 2024 23:01 Failure alit/hyena_ux_cp
November 4, 2024 23:01 Failure
CP init
.github/workflows/node-reboot.yml #1183: Commit 42f4ea9 pushed by JRD971000
November 4, 2024 23:00 Failure alit/hyena_ux_cp
November 4, 2024 23:00 Failure