Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[GPU] Force sdpa use onednn path for prefill and cl path for generation. #27387

Merged

Conversation

ceciliapeng2011
Copy link
Contributor

@ceciliapeng2011 ceciliapeng2011 commented Nov 4, 2024

Details:

  • [GPU] Force SDPA use oneDNN path for prefill and clDNN path for generation on ARL-H platform
  • ...

Tickets:

@ceciliapeng2011 ceciliapeng2011 requested review from a team as code owners November 4, 2024 12:32
@github-actions github-actions bot added the category: GPU OpenVINO GPU plugin label Nov 4, 2024
@ceciliapeng2011 ceciliapeng2011 added do not merge and removed category: GPU OpenVINO GPU plugin labels Nov 4, 2024
@github-actions github-actions bot added the category: GPU OpenVINO GPU plugin label Nov 5, 2024
Copy link
Contributor

@sshlyapn sshlyapn left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good to me, left minor comments

@sshlyapn sshlyapn changed the title force sdpa use onednn path for prefill and cl path for generation. [GPU] Force sdpa use onednn path for prefill and cl path for generation. Nov 29, 2024
@p-durandin p-durandin added Code Freeze backport label to backport PR to the release branch labels Nov 29, 2024
@yeonbok
Copy link
Contributor

yeonbok commented Dec 4, 2024

@ceciliapeng2011 LGTM, but is this only effective on ARLH? How about LNL?

@p-durandin p-durandin added this pull request to the merge queue Dec 4, 2024
github-merge-queue bot pushed a commit that referenced this pull request Dec 4, 2024
#27889)

…on... the backport of pull/27387.

### Details:
- *[GPU] Force SDPA use oneDNN path for prefill and clDNN path for
generation on ARL-H platform*
- *backport of
[pull/27387](#27387

### Tickets:
 - *[CVS-158461](https://jira.devtools.intel.com/browse/CVS-158461)*
Merged via the queue into openvinotoolkit:master with commit 571e98d Dec 4, 2024
155 checks passed
@ceciliapeng2011
Copy link
Contributor Author

@ceciliapeng2011 LGTM, but is this only effective on ARLH? How about LNL?

Yes, it is only for ARL-H. We've also benchmarked with LNL and ARC, no strong justification for them... the performance differentiates among shape and models.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
backport label to backport PR to the release branch category: GPU OpenVINO GPU plugin Code Freeze
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants