[modeling_utils] torch_dtype/auto floating dtype fixes #17614

stas00 · 2022-06-08T17:45:20Z

As reported in #17583 not all model's have their first param of floating dtype, which lead to failures like:

$ python -c 'from transformers import AutoModel; AutoModel.from_pretrained("hf-internal-testing/tiny-bert-for-token-classification", torch_dtype="auto")'
Traceback (most recent call last):
  File "<string>", line 1, in <module>
  File "/mnt/nvme0/code/huggingface/transformers-master/src/transformers/models/auto/auto_factory.py", line 446, in from_pretrained
    return model_class.from_pretrained(pretrained_model_name_or_path, *model_args, config=config, **kwargs)
  File "/mnt/nvme0/code/huggingface/transformers-master/src/transformers/modeling_utils.py", line 2004, in from_pretrained
    dtype_orig = cls._set_default_torch_dtype(torch_dtype)
  File "/mnt/nvme0/code/huggingface/transformers-master/src/transformers/modeling_utils.py", line 980, in _set_default_torch_dtype
    raise ValueError(
ValueError: Can't instantiate BertModel model under dtype=torch.int64 since it is not a floating point dtype

This PR fixes that by searching for the first floating dtype instead.
adds test that failed before this PR

Fixes: #17583

Possible additional TODO that wasn't part of the original report

@sgugger, we can sort out the saving side of things here as well - I already added an alternative get_parameter_dtype => get_parameter_first_float_dtype - but I wanted to check in with you if we replace all instances of get_parameter_dtype or only some.

I didn't go ahead with doing that since we have a method called dtype which probably should call get_parameter_dtype and add float_dtype? Not sure - let's see what you think is the best way to proceed.

HuggingFaceDocBuilderDev · 2022-06-08T17:55:06Z

The documentation is not available anymore as the PR was closed or merged.

sgugger

I wouldn't start throwing errors in those function but would return the last dtype in case everything is an int (for the unlikely case we get a quantized model).

Then I'd use this new get_parameter_first_float_dtype instead of the next parameter hack (for instance when we set the self.config.torch_dtype.

Thanks a lot for working on this!

src/transformers/modeling_utils.py

tests/test_modeling_common.py

src/transformers/modeling_utils.py

stas00 · 2022-06-09T16:42:54Z

Probably good to merge now, right?

sgugger

YEs, good for me if it's good for you :-)

…7614) * [modeling_utils] torch_dtype/auto fixes * add test * apply suggestions * add missing fallback * Renaming things * Use for else Co-authored-by: Sylvain Gugger <[email protected]>

stas00 added 2 commits June 8, 2022 10:32

[modeling_utils] torch_dtype/auto fixes

5940b20

add test

b9fc083

sgugger reviewed Jun 8, 2022

View reviewed changes

src/transformers/modeling_utils.py Show resolved Hide resolved

src/transformers/modeling_utils.py Outdated Show resolved Hide resolved

src/transformers/modeling_utils.py Show resolved Hide resolved

tests/test_modeling_common.py Show resolved Hide resolved

apply suggestions

c139f8c

sgugger reviewed Jun 8, 2022

View reviewed changes

src/transformers/modeling_utils.py Outdated Show resolved Hide resolved

sgugger reviewed Jun 8, 2022

View reviewed changes

src/transformers/modeling_utils.py Show resolved Hide resolved

sgugger reviewed Jun 8, 2022

View reviewed changes

src/transformers/modeling_utils.py Outdated Show resolved Hide resolved

stas00 and others added 2 commits June 8, 2022 12:18

add missing fallback

421cb4d

Renaming things

86484b1

stas00 commented Jun 8, 2022

View reviewed changes

src/transformers/modeling_utils.py Outdated Show resolved Hide resolved

Use for else

745895a

sgugger approved these changes Jun 9, 2022

View reviewed changes

stas00 merged commit 75343de into main Jun 9, 2022

stas00 deleted the torch_dtype_auto2 branch June 9, 2022 17:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[modeling_utils] torch_dtype/auto floating dtype fixes #17614

[modeling_utils] torch_dtype/auto floating dtype fixes #17614

stas00 commented Jun 8, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Jun 8, 2022 •

edited

Loading

sgugger left a comment •

edited

Loading

stas00 commented Jun 9, 2022

sgugger left a comment

[modeling_utils] torch_dtype/auto floating dtype fixes #17614

[modeling_utils] torch_dtype/auto floating dtype fixes #17614

Conversation

stas00 commented Jun 8, 2022 • edited Loading

HuggingFaceDocBuilderDev commented Jun 8, 2022 • edited Loading

sgugger left a comment • edited Loading

Choose a reason for hiding this comment

stas00 commented Jun 9, 2022

sgugger left a comment

Choose a reason for hiding this comment

stas00 commented Jun 8, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Jun 8, 2022 •

edited

Loading

sgugger left a comment •

edited

Loading