Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[GPU] Disable KV cache compression and FC scaling for GPU with systolic #27988

Merged
merged 3 commits into from
Dec 11, 2024

Conversation

isanghao
Copy link
Contributor

Details:

  • GPU with systolic does not need FC scaling
  • GPU with systolic does not support kv cache compresssion yet

@isanghao isanghao requested review from a team as code owners December 10, 2024 07:16
@github-actions github-actions bot added the category: GPU OpenVINO GPU plugin label Dec 10, 2024
@vladimir-paramuzov vladimir-paramuzov added this to the 2025.0 milestone Dec 10, 2024
@isanghao isanghao added this pull request to the merge queue Dec 11, 2024
@github-merge-queue github-merge-queue bot removed this pull request from the merge queue due to failed status checks Dec 11, 2024
@isanghao isanghao added this pull request to the merge queue Dec 11, 2024
Merged via the queue into openvinotoolkit:master with commit 859958f Dec 11, 2024
158 checks passed
@isanghao isanghao deleted the systolic_gpu_fix branch December 11, 2024 10:23
11happy pushed a commit to 11happy/openvino that referenced this pull request Dec 23, 2024
…ic (openvinotoolkit#27988)

### Details:
- GPU with systolic does not need FC scaling
- GPU with systolic does not support kv cache compresssion yet
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
category: GPU OpenVINO GPU plugin
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants