Skip to content

Pull requests: NVIDIA/TransformerEngine

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Fix seq_dim in CP implementation
#1264 opened Oct 16, 2024 by xrennvidia Loading…
4 of 13 tasks
[JAX] Skip V100 encoder tests
#1262 opened Oct 16, 2024 by zlsh80826 Loading…
8 of 13 tasks
Add THD + GQA supports
#1260 opened Oct 16, 2024 by zlsh80826 Loading…
8 of 13 tasks
[PyTorch] Reorganize L1 tests testing Improvements to tests or testing infrastructure
#1255 opened Oct 15, 2024 by timmoon10 Loading…
5 of 14 tasks
Draft: reduce cudagraph mem via preoallcations
#1253 opened Oct 15, 2024 by JimmyZhang12 Loading…
13 tasks
Fix layernorm fsdp
#1250 opened Oct 14, 2024 by eljandoubi Loading…
2 tasks done
fused out correction in CP
#1248 opened Oct 14, 2024 by xiaoyao0115 Loading…
12 tasks
[Bugfix] Fix bias for 0-dim tensors in gemm
#1246 opened Oct 12, 2024 by yaox12 Loading…
1 of 13 tasks
[C] Add max_t support for THD
#1244 opened Oct 11, 2024 by cyanguwa Draft
8 of 13 tasks
Save CUDA Graph memory by reusing input and output tensors
#1234 opened Oct 9, 2024 by buptzyb Loading…
5 of 13 tasks
Support CUDA Graph for MoE models
#1233 opened Oct 9, 2024 by buptzyb Loading…
6 of 13 tasks
[Pytorch] Check gradient in test numerics
#1229 opened Oct 8, 2024 by pggPL Loading…
7 of 13 tasks
[TE/JAX] Enabling CudaGraph for custom calls with FFI jax
#1228 opened Oct 7, 2024 by phu0ngng Loading…
4 of 13 tasks
[PyTorch] Improve CP P2P efficiency
#1208 opened Sep 26, 2024 by yenchenlin Loading…
1 of 6 tasks
Draft: Use fused push_send_recv kernel for TP AG and RS overlaps
#1200 opened Sep 24, 2024 by erhoo82 Loading…
13 tasks
[PyTorch] Fused dbias-cast-transpose in bias operation
#1168 opened Sep 6, 2024 by timmoon10 Loading…
7 of 13 tasks
Fix autocast deprecation warning.
#1167 opened Sep 6, 2024 by jondeaton Loading…
[PyTorch] Activation operations
#1164 opened Sep 6, 2024 by timmoon10 Loading…
6 of 13 tasks
[PyTorch] Avoid saving fp8_tensors in certain scenarios
#1143 opened Aug 28, 2024 by cyanguwa Loading…
8 of 13 tasks
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.