Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Handle .to for 4bit quantized models #186

Merged
merged 1 commit into from
May 25, 2023
Merged

Handle .to for 4bit quantized models #186

merged 1 commit into from
May 25, 2023

Conversation

g8a9
Copy link
Contributor

@g8a9 g8a9 commented May 24, 2023

Description

This PR extends the logic that prevents calling .to on 8bit quantized HF models to models loaded with 4bit quantization.

Type of Change

  • 🥂 Improvement (non-breaking change which improves an existing feature)

Copy link
Member

@gsarti gsarti left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, thanks for the timely fix!

@gsarti gsarti merged commit e3b1f59 into inseq-team:main May 25, 2023
@gsarti gsarti added this to the v0.5 milestone Jul 21, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants