FEAT: Add EETQ support in PEFT #1675

younesbelkada · 2024-04-24T09:22:39Z

What does this PR do?

This PR adds EETQ quantized linear layers support in PEFT.
EETQ has been recently added in transformers and offers a rapid 8-bit quantization inference: huggingface/transformers#30262

Fixes: #1643

Learn more about EETQ here: https://github.com/NetEase-FuXi/EETQ

TODO

Add in Dockerfile
Add tests

cc @SunMarc @BenjaminBossan @pacman100

HuggingFaceDocBuilderDev · 2024-04-24T09:26:22Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

BenjaminBossan

Thanks a lot for adding EETQ support. This implementation looks really smooth, nice!

Could you please add an entry to the quantization docs of PEFT? Maybe mention some pros/cons of EETQ in comparison to the quantization methods or a reference to where it's better explained (their README wasn't very helpful in this regard).

Did you see the CI error:

Error: The version '3.9' with architecture 'arm64' was not found for macOS 14.4.1.

Any idea what that is about?

BenjaminBossan · 2024-04-25T09:31:34Z

src/peft/tuners/lora/eetq.py

+
+
+if is_eetq_available():
+    from eetq import EetqLinear


Should we have lazy import as for bnb or is it not necessary for EETQ?

Make sense!

What I mean is should we indent all the code below to be inside of if is_eetq_available():? Or is it not necessary because, unlike bnb, EETQ does not initialize cuda?

I think it does, let's indent it to be on the safe zon

BenjaminBossan · 2024-04-25T09:33:03Z

src/peft/tuners/lora/eetq.py

+
+        self._active_adapter = adapter_name
+        self.update_layer(adapter_name, r, lora_alpha, lora_dropout, init_lora_weights, use_rslora)
+


Would merging currently work with EETQ? I would assume not. Maybe we can raise an error when users try it?

BenjaminBossan · 2024-04-25T13:18:42Z

src/peft/tuners/lora/eetq.py

+        return result
+
+    def merge(self, safe_merge: bool = False, adapter_names: Optional[list[str]] = None) -> None:
+        raise ValueError("Merging LoRA layers is not supported for Eetq layers.")


Let's also add unmerge for completeness. I also wonder if ValueError is best or if it should be TypeError (maybe AttributeError?).

Makes sense!

src/peft/tuners/lora/eetq.py

BenjaminBossan · 2024-04-25T13:22:05Z

src/peft/tuners/lora/eetq.py

+
+
+if is_eetq_available():
+    from eetq import EetqLinear


What I mean is should we indent all the code below to be inside of if is_eetq_available():? Or is it not necessary because, unlike bnb, EETQ does not initialize cuda?

…dd-eetq

BenjaminBossan

Thanks so much for adding EETQ, LGTM. I have 2 nits regarding the documentation, up to you if you want to fix them.

docs/source/developer_guides/quantization.md

BenjaminBossan · 2024-04-25T15:13:37Z

docs/source/developer_guides/quantization.md

+import torch
+from transformers import EetqConfig
+
+config = EetqConfig("int8")


This probably requires the latest transformers, right? Maybe worth adding the min version?

Co-authored-by: Benjamin Bossan <[email protected]>

v1

70afa43

fix tests'

2557cc7

younesbelkada marked this pull request as ready for review April 24, 2024 09:41

younesbelkada added 4 commits April 24, 2024 11:42

fix unneeded change

58e94c1

fix unneeded change

424215b

fix unneeded change

f7080d8

fix

2feb13c

This was referenced Apr 24, 2024

Support EETQ QLoRA #1643

Closed

FEAT: PEFT support for EETQ huggingface/transformers#30449

Merged

younesbelkada added 3 commits April 24, 2024 11:52

fix CI

cd6f074

fix docker image

11eb828

fix docker image

63a5f2e

BenjaminBossan requested changes Apr 25, 2024

View reviewed changes

younesbelkada added 5 commits April 25, 2024 11:43

add docs

5e7d693

lazy import

903ba97

raise when merge

2736684

raise when merge

179ca65

Merge remote-tracking branch 'origin/main' into add-eetq

074d900

younesbelkada requested a review from BenjaminBossan April 25, 2024 12:31

Update eetq.py

edc8da9

BenjaminBossan reviewed Apr 25, 2024

View reviewed changes

younesbelkada added 6 commits April 25, 2024 16:09

Merge remote-tracking branch 'origin/main' into add-eetq

b2da586

merge

8ed456e

Merge branch 'add-eetq' of https://github.com/huggingface/peft into a…

df8e65e

…dd-eetq

style

d8bc14e

add unmerge

1890726

indent

5534775

younesbelkada requested a review from BenjaminBossan April 25, 2024 14:13

BenjaminBossan approved these changes Apr 25, 2024

View reviewed changes

younesbelkada and others added 2 commits April 25, 2024 20:31

Update docs/source/developer_guides/quantization.md

fb03c0d

Co-authored-by: Benjamin Bossan <[email protected]>

add details about transformers

ece3ce2

younesbelkada merged commit d0fa70a into main Apr 26, 2024
16 checks passed

BenjaminBossan mentioned this pull request Aug 9, 2024

Support optimum-quanto #1997

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FEAT: Add EETQ support in PEFT #1675

FEAT: Add EETQ support in PEFT #1675

younesbelkada commented Apr 24, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented Apr 24, 2024

BenjaminBossan left a comment

BenjaminBossan Apr 25, 2024

younesbelkada Apr 25, 2024

BenjaminBossan Apr 25, 2024

younesbelkada Apr 25, 2024

BenjaminBossan Apr 25, 2024

BenjaminBossan Apr 25, 2024

younesbelkada Apr 25, 2024

BenjaminBossan Apr 25, 2024

BenjaminBossan left a comment

BenjaminBossan Apr 25, 2024


		self._active_adapter = adapter_name
		self.update_layer(adapter_name, r, lora_alpha, lora_dropout, init_lora_weights, use_rslora)

FEAT: Add EETQ support in PEFT #1675

FEAT: Add EETQ support in PEFT #1675

Conversation

younesbelkada commented Apr 24, 2024 • edited Loading

What does this PR do?

TODO

HuggingFaceDocBuilderDev commented Apr 24, 2024

BenjaminBossan left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

BenjaminBossan left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

younesbelkada commented Apr 24, 2024 •

edited

Loading