Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merge with upstream #48

Open
wants to merge 2,585 commits into
base: main
Choose a base branch
from
Open

Merge with upstream #48

wants to merge 2,585 commits into from

Conversation

Quentin-Anthony
Copy link
Collaborator

No description provided.

@Quentin-Anthony Quentin-Anthony self-assigned this Jan 15, 2024
yaox12 and others added 29 commits October 29, 2024 22:10
Fix async_grad_allreduce deprecation warning

See merge request ADLR/megatron-lm!2247
openai completions  endpoint

See merge request ADLR/megatron-lm!2212
Add unit tests for Mamba hybrid model sub-units

See merge request ADLR/megatron-lm!2233
tests: Fix backoff

See merge request ADLR/megatron-lm!2287
revert: Try/catch

See merge request ADLR/megatron-lm!2288
More multimodal evals

See merge request ADLR/megatron-lm!2174
Co-authored-by: Dingqing Yang <[email protected]>
Co-authored-by: Dingqing Yang <[email protected]>
Co-authored-by: Dingqing Yang <[email protected]>
Co-authored-by: Dingqing Yang <[email protected]>
Co-authored-by: Dingqing Yang <[email protected]>
Co-authored-by: Dingqing Yang <[email protected]>
Co-authored-by: Sangkug Lym <[email protected]>
tunable schedule with overlapping

See merge request ADLR/megatron-lm!2117
[Test] Fix Config for RoPE Fusion

See merge request ADLR/megatron-lm!2298
Add dist-ckpt support to encoder_pipeline_parallel

See merge request ADLR/megatron-lm!2210
Add TestTransformerLayerInterface test

See merge request ADLR/megatron-lm!2297
ci: Fix nightly tests

See merge request ADLR/megatron-lm!2300
tests: Disable flaky test

See merge request ADLR/megatron-lm!2302
tests: Disable modelopt test on dev

See merge request ADLR/megatron-lm!2303
Remove `is_onnx_export_mode` import from TE

See merge request ADLR/megatron-lm!2296
Shunkangz and others added 30 commits December 7, 2024 19:53
…requency Patterns and Configurable MoE FFN Hidden Size

Co-authored-by: Zijie Yan <[email protected]>
Co-authored-by: xuwenc <[email protected]>
Enhance MoE Architecture: Support MoE Layer Frequency Patterns and Configurable MoE FFN Hidden Size

Closes #225

See merge request ADLR/megatron-lm!2230
Resolve "Attention as a config option in mcore"

Closes #326

See merge request ADLR/megatron-lm!2168
…memory allocation, no unnecessary casting/copying

Co-authored-by: Mcore Bot <[email protected]>
sample index helper function, no unnecessary memory allocation, no unnecessary casting/copying

See merge request ADLR/megatron-lm!2381
Fix peak memory consumption for NeMo

See merge request ADLR/megatron-lm!2388
[dist ckpt] Use gather object instead of all gather object when running consistency check

See merge request ADLR/megatron-lm!2413
Co-authored-by: Cyril Meurillon <[email protected]>
Co-authored-by: Deepak Narayanan <[email protected]>
Co-authored-by: Cyril Meurillon <[email protected]>
Add functionality to re-run iterations

See merge request ADLR/megatron-lm!2282
Bugfix in multimodal dataloader_provider

See merge request ADLR/megatron-lm!2418
Refactor MoE specs: move all submodules of MoELayer into the spec

Closes #314

See merge request ADLR/megatron-lm!2101
Remove all-gather before first iteration to not spread corrupted values

See merge request ADLR/megatron-lm!2414
move get_batch_on_this_cp_rank to mcore utils

See merge request ADLR/megatron-lm!2404
Small VLM example

See merge request ADLR/megatron-lm!2432
Fix assert warning in !2282

See merge request ADLR/megatron-lm!2443
Fix wrapping of external dataloaders

See merge request ADLR/megatron-lm!2453
Fix moe dist-ckpt compatibility for !2230

See merge request ADLR/megatron-lm!2449
Llava pp > 1 fix

See merge request ADLR/megatron-lm!2441
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.