Adding support for 8-bit training with bitsandbytes #582

mrcabbage972 · 2023-01-10T03:08:34Z

Adding bitsandbytes dependency to requirements.txt.

Using the currently unused quantization option in the config file to set whether to use the BNB 8-bit optimizer.

Details on using the 8-bit optimizer in HF here and detailed discussion on HF implementation here.

Note the part about forcing the embedding layer to use the 32-bit optimizer. Doing that as suggested in the link above:

For existing pre-trained transformers models one could use them as is and use 8-bit optimizers for all weights, but 32-bit optimizers for the embedding layer.

This override code might require a bit more work if we choose to use a model that has a non-standard embedding layer.

sanagno · 2023-01-10T10:48:57Z

Thanks a lot, looks great! Can you also run the pre-commit for the final commit?

mrcabbage972 · 2023-01-11T02:14:54Z

@sanagno Yes, fixed.

mrcabbage972 added 6 commits January 8, 2023 18:55

Expanding survey of relevant research

3bf0e3a

Fixing EoL

cc4c008

Merge branch 'LAION-AI:main' into main

7167f69

Adding BNB 8-bit Adam

08bdadf

Merge remote-tracking branch 'origin/main'

adf631e

Merge branch 'LAION-AI:main' into main

2d4a47d

mrcabbage972 requested review from theblackcat102 and sanagno as code owners January 10, 2023 03:08

Adding override of 32-bit optimization for embedding layer

67aeed2

mrcabbage972 mentioned this pull request Jan 10, 2023

Supervised finetuning minor changes #456

Merged

yk added the ml label Jan 10, 2023

Fixing requirements file

d95c741

sanagno approved these changes Jan 11, 2023

View reviewed changes

sanagno merged commit 1c5d44c into LAION-AI:main Jan 11, 2023

sanagno pushed a commit that referenced this pull request Jan 11, 2023

quantization from #582

6438fdb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding support for 8-bit training with bitsandbytes #582

Adding support for 8-bit training with bitsandbytes #582

mrcabbage972 commented Jan 10, 2023 •

edited

Loading

sanagno commented Jan 10, 2023

mrcabbage972 commented Jan 11, 2023

Adding support for 8-bit training with bitsandbytes #582

Adding support for 8-bit training with bitsandbytes #582

Conversation

mrcabbage972 commented Jan 10, 2023 • edited Loading

sanagno commented Jan 10, 2023

mrcabbage972 commented Jan 11, 2023

mrcabbage972 commented Jan 10, 2023 •

edited

Loading