Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[GPU] Integrate dynamic quantization for onednn #26940

Merged
merged 29 commits into from
Dec 9, 2024

Conversation

isanghao
Copy link
Contributor

@isanghao isanghao commented Oct 8, 2024

Details:

  • Integrated grouped dynamic quantization from onednn
  • Integrated asymmetric per-token dynamic quantization from onednn
  • Those are not enabled by default, yet

Tickets:

  • 148732, 157869, 157589

Copy link
Contributor

github-actions bot commented Dec 2, 2024

This PR will be closed in a week because of 2 weeks of no activity.

@github-actions github-actions bot added the Stale label Dec 2, 2024
@isanghao isanghao removed the Stale label Dec 2, 2024
@isanghao isanghao marked this pull request as ready for review December 2, 2024 09:21
@isanghao isanghao requested review from a team as code owners December 2, 2024 09:21
@isanghao isanghao changed the title [GPU] grouped dynamic quantization for onednn [GPU] Integrate dynamic quantization for onednn Dec 4, 2024
@isanghao isanghao added this pull request to the merge queue Dec 6, 2024
@github-merge-queue github-merge-queue bot removed this pull request from the merge queue due to failed status checks Dec 6, 2024
@isanghao isanghao added this pull request to the merge queue Dec 9, 2024
github-merge-queue bot pushed a commit that referenced this pull request Dec 9, 2024
### Details:
 - Integrated grouped dynamic quantization from onednn
 - Integrated asymmetric per-token dynamic quantization from onednn
 - Those are not enabled by default, yet

### Tickets:
 - 148732, 157869, 157589
@github-merge-queue github-merge-queue bot removed this pull request from the merge queue due to failed status checks Dec 9, 2024
@isanghao isanghao added this pull request to the merge queue Dec 9, 2024
Merged via the queue into openvinotoolkit:master with commit b840082 Dec 9, 2024
155 checks passed
@isanghao isanghao deleted the dyn_quan_gs branch December 9, 2024 06:59
11happy pushed a commit to 11happy/openvino that referenced this pull request Dec 23, 2024
### Details:
 - Integrated grouped dynamic quantization from onednn
 - Integrated asymmetric per-token dynamic quantization from onednn
 - Those are not enabled by default, yet

### Tickets:
 - 148732, 157869, 157589
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
category: GPU OpenVINO GPU plugin
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants