Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add mul absorbing for smooth quant of onnxrt adaptor #807

Merged
merged 24 commits into from
Apr 28, 2023
Merged

Conversation

mengniwang95
Copy link
Contributor

Type of Change

feature

Description

Add mul absorbing function for smooth quant of onnxrt adaptor

Expected Behavior & Potential Risk

UT pass

How has this PR been tested?

UT, local LLM test

Dependency Change?

no

@mengniwang95
Copy link
Contributor Author

mengniwang95 commented Apr 18, 2023

@chensuyue
Copy link
Contributor

extension test

@chensuyue
Copy link
Contributor

SmoothQuant Vs Common Quant report

@chensuyue chensuyue merged commit 3df6478 into master Apr 28, 2023
@chensuyue chensuyue deleted the mengni/sq branch April 28, 2023 09:15
yiliu30 pushed a commit to yiliu30/neural-compressor that referenced this pull request Feb 28, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants