[`ModulesToSave`] add correct hook management for modules to save #755

younesbelkada · 2023-07-26T16:14:37Z

What does this PR do?

Fixes #602 and the solution has been found out by @BenjaminBossan
When loading the base model with accelerate, the old hooks were still attached to the new module, causing the ModulesToSaveWrapper module to call the previous forward method, leading to the gradients not being properly backpropagated to the right module.

A reproducible script shared by Benjamin:

import torch
from peft import LoraConfig, TaskType, get_peft_model, prepare_model_for_kbit_training
from transformers import AutoTokenizer, AutoModelForSequenceClassification, set_seed
set_seed(123)

PRETRAIN = 'bigscience/bloomz-560m'
tokenizer = AutoTokenizer.from_pretrained(PRETRAIN)

load_in_4bit=True
device_map=None
should_resize_token_embeddings=False
should_prepare_model_for_kbit_training=True

model = AutoModelForSequenceClassification.from_pretrained(
    PRETRAIN,
    load_in_4bit=load_in_4bit,
    torch_dtype=torch.float32,
    device_map=device_map,
)
if should_prepare_model_for_kbit_training:
    model = prepare_model_for_kbit_training(model)

config = LoraConfig(
    r=16,
    lora_alpha=16,
    lora_dropout=0.05,
    bias="none",
    task_type=TaskType.SEQ_CLS,
)

peft_model = get_peft_model(model, config)
if should_resize_token_embeddings:
    model.resize_token_embeddings(len(tokenizer))

lm_head = peft_model.base_model.model.score
original_module = lm_head.original_module
modules_to_save = lm_head.modules_to_save.default

inputs = torch.randn((1024))
o1 = lm_head(inputs)
o1.mean().backward()

assert modules_to_save.weight.requires_grad is True
assert original_module.weight.grad is None
assert modules_to_save.weight.grad is not None

The fix is to create a fresh new copy of the previous hook to keep the same attributes, remove the old hook and attach that new hook to the module.

cc @BenjaminBossan @pacman100

HuggingFaceDocBuilderDev · 2023-07-26T16:20:25Z

The documentation is not available anymore as the PR was closed or merged.

pacman100

Thank you @BenjaminBossan for the fixes, LGTM! 🚀

BenjaminBossan

LGTM too, thanks for the PR. Let's wait if the accelerate devs have any concerns before merging.

BenjaminBossan · 2023-07-26T17:04:43Z

tests/test_common_gpu.py

+    @require_torch_gpu
+    @pytest.mark.single_gpu_tests
+    @require_bitsandbytes
+    def test_modules_to_save_grad(self):


Do we want to throw in the device_map="auto" option too?

load_in_4bit should automatically set device_map to the correct value: https://github.com/huggingface/transformers/blob/a0042379269bea9182c1f87e6b2eee4ba4c8cce8/src/transformers/modeling_utils.py#L2318

Okay. I was just thinking about the users reporting that the issue also occurs with device_map="auto" and without quantization, so we could cover that too. But it's not super important.

younesbelkada and others added 2 commits July 26, 2023 16:12

add correct hook management for modules to save

abef2cd

forward contrib credits from finding the solution

84b2d58

add nice GPU tests

a091955

BenjaminBossan mentioned this pull request Jul 26, 2023

modules_to_save incompatible with load_in_4bit / load_in_8bit ? #602

Closed

4 tasks

quality

6573f55

pacman100 approved these changes Jul 26, 2023

View reviewed changes

BenjaminBossan approved these changes Jul 26, 2023

View reviewed changes

vincentmin mentioned this pull request Jul 26, 2023

How to load a trained reward model? Different (random) results each time the model is loaded. huggingface/trl#578

Closed

BenjaminBossan mentioned this pull request Jul 27, 2023

Model forgets finetuning after saving / loading #503

Closed

younesbelkada merged commit e27e883 into huggingface:main Jul 27, 2023
11 checks passed

younesbelkada deleted the fix-modules-to-save branch July 27, 2023 08:29

thoo mentioned this pull request Oct 17, 2023

Trainer of AutoModelForSequenceClassification is saving the wrong score module (or trained parameters are in the wrong module) huggingface/transformers#26160

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[`ModulesToSave`] add correct hook management for modules to save #755

[`ModulesToSave`] add correct hook management for modules to save #755

younesbelkada commented Jul 26, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Jul 26, 2023 •

edited

Loading

pacman100 left a comment

BenjaminBossan left a comment

BenjaminBossan Jul 26, 2023

younesbelkada Jul 27, 2023

BenjaminBossan Jul 27, 2023

[ModulesToSave] add correct hook management for modules to save #755

[ModulesToSave] add correct hook management for modules to save #755

Conversation

younesbelkada commented Jul 26, 2023 • edited Loading

What does this PR do?

HuggingFaceDocBuilderDev commented Jul 26, 2023 • edited Loading

pacman100 left a comment

Choose a reason for hiding this comment

BenjaminBossan left a comment

Choose a reason for hiding this comment

BenjaminBossan Jul 26, 2023

Choose a reason for hiding this comment

younesbelkada Jul 27, 2023

Choose a reason for hiding this comment

BenjaminBossan Jul 27, 2023

Choose a reason for hiding this comment

[`ModulesToSave`] add correct hook management for modules to save #755

[`ModulesToSave`] add correct hook management for modules to save #755

younesbelkada commented Jul 26, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Jul 26, 2023 •

edited

Loading