[docs] Lora-like guides #1371

stevhliu · 2024-01-18T21:23:07Z

As discussed in #1332, this PR condenses the LoRA guides by showcasing one specific task type (image classification) and redirecting to other use cases/tasks stored on the PEFT Hub org. Instead, this guide expands on the other LoRA methods such as LoHa, LoKr, and AdaLoRA. The configs of these methods aren't optimized for the best performance, so any feedback you have on that would be super useful!

HuggingFaceDocBuilderDev · 2024-01-18T21:30:32Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

pacman100

Thank you @stevhliu for consolidating all the LoRA like guides into a single guide while focusing on usage of different LoRA variants. Thank you for the HF Space collection of notebook examples.

Left a few suggestions and comments.

docs/source/task_guides/lora_based_methods.md

pacman100

Thank you! 🤗

younesbelkada

Thanks @stevhliu !

BenjaminBossan

Thanks for writing this excellent guide and consolidating the different methods. Overall, this looks really good, I just have a concern regarding the use of AdaLora. Please check the comment and the solutions I suggested.

BenjaminBossan · 2024-02-06T10:37:32Z

docs/source/task_guides/lora_based_methods.md

+</hfoption>
+<hfoption id="AdaLoRA">
+
+[AdaLoRA](../conceptual_guides/adapter#adaptive-low-rank-adaptation-adalora) efficiently manages the LoRA parameter budget by assigning important weight matrices more parameters and pruning less important ones. In contrast, LoRA evenly distributes parameters across all modules. You can control the average desired *rank* or `r` of the matrices, and which modules to apply AdaLoRA to with `target_modules`. Other important parameters to set are `lora_alpha` (scaling factor), and `modules_to_save` (the modules apart from the AdaLoRA layers to be trained and saved). All of these parameters - and more - are found in the [`AdaLoraConfig`].


There is a bit of an issue with the AdaLora example. For AdaLora, we need to call the method update_and_allocate during each training step, otherwise, the adaptation step is not performed. This requires to either write a custom training loop or to subclass the Trainer to add this step. An example of a custom training loop that calls this method is here:

peft/examples/conditional_generation/peft_adalora_seq2seq.py

Line 120 in 912ad41

model.base_model.update_and_allocate(global_step)

Given this guide, as is, the reader may not be aware that this is required and thus use AdaLora incorrectly. These solutions come to mind:

Don't provide the AdaLora example. Instead, refer to the existing examples with custom training loops.

Don't use Trainer, instead use a custom training loop that calls this method when AdaLora is being used.

Subclass Trainer to perform this step if AdaLora is being used. I don't know enough about Trainer to be really sure how to do that correctly.

Great point! I mention update_and_allocate + link to a custom training loop in a Warning in the training section for more visibility. I think we should still keep the example of showing the AdaLoraConfig though so users still know how to set it up.

BenjaminBossan

Happy with the added warning, thanks for adding it.

* loras * review * fix * feedback * feedback

stevhliu requested review from BenjaminBossan, pacman100 and younesbelkada January 18, 2024 21:34

stevhliu mentioned this pull request Jan 25, 2024

[docs] Minor fixes to tutorial on semantic segmentation with LoRA #1386

Closed

pacman100 reviewed Jan 29, 2024

View reviewed changes

docs/source/task_guides/lora_based_methods.md Outdated Show resolved Hide resolved

docs/source/task_guides/lora_based_methods.md Outdated Show resolved Hide resolved

docs/source/task_guides/lora_based_methods.md Show resolved Hide resolved

stevhliu added 4 commits January 29, 2024 11:48

loras

56a697d

review

ea7064c

fix

94097d4

feedback

c1b5416

stevhliu force-pushed the lora-guides branch from bb69f76 to c1b5416 Compare January 29, 2024 19:59

pacman100 approved these changes Jan 31, 2024

View reviewed changes

younesbelkada approved these changes Jan 31, 2024

View reviewed changes

BenjaminBossan requested changes Feb 6, 2024

View reviewed changes

feedback

7fe002d

stevhliu requested a review from BenjaminBossan February 6, 2024 17:11

BenjaminBossan approved these changes Feb 7, 2024

View reviewed changes

stevhliu merged commit 9bb83ed into huggingface:main Feb 7, 2024
14 checks passed

stevhliu deleted the lora-guides branch February 7, 2024 19:22

BenjaminBossan pushed a commit to BenjaminBossan/peft that referenced this pull request Mar 14, 2024

[docs] Lora-like guides (huggingface#1371)

7ac400d

* loras * review * fix * feedback * feedback

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[docs] Lora-like guides #1371

[docs] Lora-like guides #1371

stevhliu commented Jan 18, 2024

HuggingFaceDocBuilderDev commented Jan 18, 2024

pacman100 left a comment

pacman100 left a comment

younesbelkada left a comment

BenjaminBossan left a comment

BenjaminBossan Feb 6, 2024

stevhliu Feb 6, 2024

BenjaminBossan left a comment

[docs] Lora-like guides #1371

[docs] Lora-like guides #1371

Conversation

stevhliu commented Jan 18, 2024

HuggingFaceDocBuilderDev commented Jan 18, 2024

pacman100 left a comment

Choose a reason for hiding this comment

pacman100 left a comment

Choose a reason for hiding this comment

younesbelkada left a comment

Choose a reason for hiding this comment

BenjaminBossan left a comment

Choose a reason for hiding this comment

BenjaminBossan Feb 6, 2024

Choose a reason for hiding this comment

stevhliu Feb 6, 2024

Choose a reason for hiding this comment

BenjaminBossan left a comment

Choose a reason for hiding this comment