Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[v3] Training refactor - MultiGPU, loss logging, bf16, etc. #2449

Merged
merged 90 commits into from
Apr 25, 2024

Commits on Oct 26, 2022

  1. Configuration menu
    Copy the full SHA
    d4aea88 View commit details
    Browse the repository at this point in the history
  2. Fix type of tokenizer

    matthewfranglen committed Oct 26, 2022
    Configuration menu
    Copy the full SHA
    602c0a4 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    9d984c9 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    c5f3dde View commit details
    Browse the repository at this point in the history

Commits on Jan 23, 2024

  1. Configuration menu
    Copy the full SHA
    0b179cf View commit details
    Browse the repository at this point in the history

Commits on Jan 25, 2024

  1. Configuration menu
    Copy the full SHA
    3ebcc27 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    09fce93 View commit details
    Browse the repository at this point in the history
  3. Resolve test issues

    tomaarsen committed Jan 25, 2024
    Configuration menu
    Copy the full SHA
    cf7d1c7 View commit details
    Browse the repository at this point in the history
  4. Reformat

    tomaarsen committed Jan 25, 2024
    Configuration menu
    Copy the full SHA
    14fac0a View commit details
    Browse the repository at this point in the history
  5. Update the imports

    tomaarsen committed Jan 25, 2024
    Configuration menu
    Copy the full SHA
    8a8912b View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    dea5c94 View commit details
    Browse the repository at this point in the history
  7. Remove dead code

    tomaarsen committed Jan 25, 2024
    Configuration menu
    Copy the full SHA
    ff8ef4d View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    636ba4d View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    7840bac View commit details
    Browse the repository at this point in the history
  10. Configuration menu
    Copy the full SHA
    c0461e9 View commit details
    Browse the repository at this point in the history

Commits on Jan 26, 2024

  1. Avoid use_amp on CPU tests

    tomaarsen committed Jan 26, 2024
    Configuration menu
    Copy the full SHA
    eb3ccfe View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    aaf72f3 View commit details
    Browse the repository at this point in the history
  3. Avoid circular import

    tomaarsen committed Jan 26, 2024
    Configuration menu
    Copy the full SHA
    9a3912b View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    036bc40 View commit details
    Browse the repository at this point in the history

Commits on Feb 5, 2024

  1. Configuration menu
    Copy the full SHA
    37ab364 View commit details
    Browse the repository at this point in the history

Commits on Feb 6, 2024

  1. Use tokenize function instead of tokenizer;

    Add EvaluatorCallback which calls the evaluator on every epoch (for BC);
    Stop saving "do_lower_case" from Transformer;
    tomaarsen committed Feb 6, 2024
    Configuration menu
    Copy the full SHA
    de51acd View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    0b3811e View commit details
    Browse the repository at this point in the history
  3. Reformat

    tomaarsen committed Feb 6, 2024
    Configuration menu
    Copy the full SHA
    645bea8 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    9cfeb0c View commit details
    Browse the repository at this point in the history
  5. Add support for the tokenizer to return more than just input_ids & at…

    …tention_masks
    
    Required for LSTM
    tomaarsen committed Feb 6, 2024
    Configuration menu
    Copy the full SHA
    69800a3 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    b0dce5d View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    78efaec View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    0ec8ef4 View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    5e69caf View commit details
    Browse the repository at this point in the history
  10. Pass epoch to evaluator in every-epoch callback

    For fit backwards compatibility
    tomaarsen committed Feb 6, 2024
    Configuration menu
    Copy the full SHA
    8a7c56b View commit details
    Browse the repository at this point in the history
  11. Run formatting

    tomaarsen committed Feb 6, 2024
    Configuration menu
    Copy the full SHA
    016983a View commit details
    Browse the repository at this point in the history

Commits on Feb 7, 2024

  1. Configuration menu
    Copy the full SHA
    d2d1f1b View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    66d15a1 View commit details
    Browse the repository at this point in the history
  3. Remove dead code

    tomaarsen committed Feb 7, 2024
    Configuration menu
    Copy the full SHA
    170843c View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    af60a59 View commit details
    Browse the repository at this point in the history
  5. Reformatting

    tomaarsen committed Feb 7, 2024
    Configuration menu
    Copy the full SHA
    620003b View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    7aed046 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    3f02af2 View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    d79bffa View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    984b0d6 View commit details
    Browse the repository at this point in the history

Commits on Feb 13, 2024

  1. Configuration menu
    Copy the full SHA
    581b023 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    2eb0cb3 View commit details
    Browse the repository at this point in the history
  3. Reformat

    tomaarsen committed Feb 13, 2024
    Configuration menu
    Copy the full SHA
    96063f9 View commit details
    Browse the repository at this point in the history

Commits on Feb 15, 2024

  1. Configuration menu
    Copy the full SHA
    c736f44 View commit details
    Browse the repository at this point in the history

Commits on Mar 5, 2024

  1. Configuration menu
    Copy the full SHA
    424f807 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    5abff20 View commit details
    Browse the repository at this point in the history

Commits on Mar 8, 2024

  1. Allow for fsdp=["full_shard", "auto_wrap"]

    with fsdp_config={"transformer_layer_cls_to_wrap": "BertLayer"}
    tomaarsen committed Mar 8, 2024
    Configuration menu
    Copy the full SHA
    37d6007 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    732b7a2 View commit details
    Browse the repository at this point in the history

Commits on Mar 12, 2024

  1. Configuration menu
    Copy the full SHA
    87fac73 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    4982cb7 View commit details
    Browse the repository at this point in the history

Commits on Mar 13, 2024

  1. When training with multiple datasets, add "dataset_name" column

    Rather than relying on some Batch Sampler hacking (which fails with some distributed training approaches)
    tomaarsen committed Mar 13, 2024
    Configuration menu
    Copy the full SHA
    bfa29a8 View commit details
    Browse the repository at this point in the history
  2. Update type hints: make loss & evaluator optional

    Co-authored-by: Wang Bo <[email protected]>
    tomaarsen and bwanglzu authored Mar 13, 2024
    Configuration menu
    Copy the full SHA
    7351908 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    d1fa120 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    4f97f21 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    deeb1a8 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    a0071a5 View commit details
    Browse the repository at this point in the history

Commits on Mar 25, 2024

  1. Configuration menu
    Copy the full SHA
    bf940bc View commit details
    Browse the repository at this point in the history

Commits on Apr 11, 2024

  1. Configuration menu
    Copy the full SHA
    f937325 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    59eed8c View commit details
    Browse the repository at this point in the history
  3. Remove some dead code

    tomaarsen committed Apr 11, 2024
    Configuration menu
    Copy the full SHA
    078ac76 View commit details
    Browse the repository at this point in the history
  4. Fix evaluator type hints

    tomaarsen committed Apr 11, 2024
    Configuration menu
    Copy the full SHA
    cd924ca View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    518a71f View commit details
    Browse the repository at this point in the history
  6. Rephrase comments slightly

    tomaarsen committed Apr 11, 2024
    Configuration menu
    Copy the full SHA
    19eaadb View commit details
    Browse the repository at this point in the history

Commits on Apr 16, 2024

  1. Configuration menu
    Copy the full SHA
    459ca98 View commit details
    Browse the repository at this point in the history

Commits on Apr 17, 2024

  1. Configuration menu
    Copy the full SHA
    6f7816b View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    21ae280 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    9062671 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    650c4eb View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    778480f View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    cd660e8 View commit details
    Browse the repository at this point in the history
  7. Improve docstring slightly

    tomaarsen committed Apr 17, 2024
    Configuration menu
    Copy the full SHA
    17d99fe View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    8804443 View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    d1b0b9d View commit details
    Browse the repository at this point in the history

Commits on Apr 18, 2024

  1. Remove moot TYPE_CHECKING

    tomaarsen committed Apr 18, 2024
    Configuration menu
    Copy the full SHA
    600bc6f View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    bcff5d3 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    1c4c720 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    959c19b View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    56321ae View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    3de8910 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    e7717b6 View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    92ac930 View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    a7688a1 View commit details
    Browse the repository at this point in the history

Commits on Apr 19, 2024

  1. Remove unused arguments

    tomaarsen committed Apr 19, 2024
    Configuration menu
    Copy the full SHA
    22f9edf View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    406d0d0 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    1f233e9 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    7eb077f View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    db025eb View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    5a1e2f5 View commit details
    Browse the repository at this point in the history

Commits on Apr 23, 2024

  1. Configuration menu
    Copy the full SHA
    7b42232 View commit details
    Browse the repository at this point in the history

Commits on Apr 24, 2024

  1. Configuration menu
    Copy the full SHA
    56922e3 View commit details
    Browse the repository at this point in the history