-
Notifications
You must be signed in to change notification settings - Fork 2.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[TRANSFORMATIONS][GPU] SDPA Fusion passes #28042
base: master
Are you sure you want to change the base?
[TRANSFORMATIONS][GPU] SDPA Fusion passes #28042
Conversation
44480fe
to
792a90f
Compare
auto scaled_q = makePattern<ov::op::v1::Multiply>({q, scale_q}); | ||
auto scaled_k = makePattern<ov::op::v1::Multiply>({k, scale_k}); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Doesn't it make sense to make one of these multiply optional?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Relaxed pattern restrictions to support optional scales + non-constant scale case
792a90f
to
54a10c4
Compare
pattern_map.at(softmax).get_target_inputs().size() > 1) { | ||
return false; | ||
} | ||
if (pattern_map.count(optional_add_mask) && (pattern_map.at(optional_add_mask).get_target_inputs().size() > 1 || |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why do we need this restriction on the number of mask consumers?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
That's not mask consumers, but Add op consumers.
Signed-off-by: Vladimir Paramuzov <[email protected]>
54a10c4
to
83ccfe3
Compare
Details:
T5 case