Skip to content

Actions: EleutherAI/gpt-neox

Pull Request

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
363 workflow runs
363 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

add rwkv support
Pull Request #899: Pull request #1198 synchronize by Quentin-Anthony
April 19, 2024 01:34 8m 1s rwkv
April 19, 2024 01:34 8m 1s
add rwkv support
Pull Request #898: Pull request #1198 synchronize by Quentin-Anthony
April 19, 2024 01:30 7m 51s rwkv
April 19, 2024 01:30 7m 51s
add rwkv support
Pull Request #897: Pull request #1198 synchronize by Quentin-Anthony
April 19, 2024 01:26 9m 45s rwkv
April 19, 2024 01:26 9m 45s
Adding replay into GPT-NeoX
Pull Request #896: Pull request #1200 synchronize by bentherien
April 14, 2024 20:40 4m 27s adding_replay
April 14, 2024 20:40 4m 27s
Adding replay into GPT-NeoX
Pull Request #895: Pull request #1200 synchronize by bentherien
April 14, 2024 20:36 4m 25s adding_replay
April 14, 2024 20:36 4m 25s
Adding replay into GPT-NeoX
Pull Request #894: Pull request #1200 synchronize by bentherien
April 14, 2024 20:24 9m 54s adding_replay
April 14, 2024 20:24 9m 54s
Adding replay into GPT-NeoX
Pull Request #893: Pull request #1200 opened by AIproj
April 13, 2024 00:08 2m 47s adding_replay
April 13, 2024 00:08 2m 47s
Megablocks-based MoE
Pull Request #890: Pull request #1197 synchronize by StellaAthena
March 28, 2024 16:52 1m 1s DayOfThePenguin:megablocks_pr
March 28, 2024 16:52 1m 1s
Megablocks-based MoE
Pull Request #889: Pull request #1197 synchronize by StellaAthena
March 28, 2024 16:52 1m 16s DayOfThePenguin:megablocks_pr
March 28, 2024 16:52 1m 16s
Megablocks-based MoE
Pull Request #888: Pull request #1197 synchronize by StellaAthena
March 28, 2024 16:52 1m 20s DayOfThePenguin:megablocks_pr
March 28, 2024 16:52 1m 20s
Megablocks-based MoE
Pull Request #887: Pull request #1197 synchronize by StellaAthena
March 28, 2024 16:51 1m 54s DayOfThePenguin:megablocks_pr
March 28, 2024 16:51 1m 54s
Added infinite lr schedules
Pull Request #883: Pull request #1194 opened by kshitijkg
March 25, 2024 15:14 8m 56s infinite_lr
March 25, 2024 15:14 8m 56s
[ZeRO-3] Ensured passing neox deepspeed_config when using partitioned init
Pull Request #881: Pull request #1191 synchronize by R0n12
March 20, 2024 04:03 1m 49s R0n12:lang/z3-init
March 20, 2024 04:03 1m 49s
[AMD] Supporting fused kernels build using JIT
Pull Request #880: Pull request #1188 synchronize by R0n12
March 20, 2024 00:09 2m 11s R0n12:lang/neox-amd
March 20, 2024 00:09 2m 11s
[ZeRO-3] Partitioned init with deepspeed.zero.Init()
Pull Request #879: Pull request #1190 synchronize by Quentin-Anthony
March 18, 2024 18:04 1m 38s R0n12:lang/z3-init
March 18, 2024 18:04 1m 38s
[ZeRO-3] Partitioned init with deepspeed.zero.Init()
Pull Request #878: Pull request #1190 synchronize by Quentin-Anthony
March 18, 2024 18:03 1m 56s R0n12:lang/z3-init
March 18, 2024 18:03 1m 56s
[ZeRO-3] Partitioned init with deepspeed.zero.Init()
Pull Request #877: Pull request #1190 opened by R0n12
March 18, 2024 09:37 1m 22s R0n12:lang/z3-init
March 18, 2024 09:37 1m 22s
[AMD] Supporting fused kernels build using JIT
Pull Request #876: Pull request #1188 synchronize by R0n12
March 18, 2024 06:21 1m 57s R0n12:lang/neox-amd
March 18, 2024 06:21 1m 57s
Mamba + Tensor Parallel Support
Pull Request #872: Pull request #1184 synchronize by Quentin-Anthony
March 15, 2024 14:42 7m 17s tp-mamba-neox
March 15, 2024 14:42 7m 17s
Mamba + Tensor Parallel Support
Pull Request #869: Pull request #1184 opened by haileyschoelkopf
March 12, 2024 15:36 9m 39s tp-mamba-neox
March 12, 2024 15:36 9m 39s
Switch to using Cuda Flash Attn for Alibi
Pull Request #868: Pull request #1183 opened by haileyschoelkopf
March 10, 2024 19:26 1d 0h 22m 25s flash-alibi-cuda
March 10, 2024 19:26 1d 0h 22m 25s