[CINN]Add qkv unpack attn #64439

phlrain · 2024-05-20T02:48:26Z

PR Category

CINN

PR Types

Others

Description

pcard-76996

添加q k v unpack的attention
q = [1, 1, head * head_dim]
k = [1, seq_len, head * head_dim]
v = [1, seq_len, head * head_dim]
当前仅支持bs = 1的情况，后续会逐步完善

…e/Paddle into add_qkv_unpack_attn

… add_qkv_unpack_attn

paddle-bot · 2024-05-20T02:48:31Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

… add_qkv_unpack_attn

XiaoguangHu01

LGTM

Aurelius84 · 2024-05-23T02:49:03Z

paddle/phi/ops/yaml/ops.yaml

@@ -2514,6 +2514,16 @@
  backward : put_along_axis_grad
  interfaces : paddle::dialect::InferSymbolicShapeInterface

+- op : qkv_unpack_mha


看起来这个是一个fusion op，应该放到fused_ops.yaml里？

zyfncg

TODO：该算子移到fusion_ops.yaml中

* add fast infer attention * remove usless code * fix rocm compile bug * polish code * fix conflict * remove depulicate

phlrain added 4 commits May 14, 2024 14:49

add fast infer attention

c5b86e5

Merge commit 'refs/pull/64282/head' of https://github.com/PaddlePaddl…

f542424

…e/Paddle into add_qkv_unpack_attn

remove usless code

1d2145a

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

e7d8de4

… add_qkv_unpack_attn

phlrain added 6 commits May 20, 2024 15:19

fix rocm compile bug

58e6869

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

7da5c82

… add_qkv_unpack_attn

polish code

e528f79

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

dfcfbb9

… add_qkv_unpack_attn

fix conflict

8b472de

remove depulicate

04a30b0

phlrain changed the title ~~Add qkv unpack attn~~ [CINN]Add qkv unpack attn May 23, 2024

tc20042008 approved these changes May 23, 2024

View reviewed changes

XiaoguangHu01 approved these changes May 23, 2024

View reviewed changes

Aurelius84 approved these changes May 23, 2024

View reviewed changes

Aurelius84 reviewed May 23, 2024

View reviewed changes

zyfncg approved these changes May 23, 2024

View reviewed changes

phlrain merged commit 6e7449c into PaddlePaddle:develop May 23, 2024
31 checks passed

co63oc pushed a commit to co63oc/Paddle that referenced this pull request May 23, 2024

[CINN]Add qkv unpack attn (PaddlePaddle#64439)

26074e3

* add fast infer attention * remove usless code * fix rocm compile bug * polish code * fix conflict * remove depulicate

chen2016013 pushed a commit to chen2016013/Paddle that referenced this pull request May 26, 2024

[CINN]Add qkv unpack attn (PaddlePaddle#64439)

49db1eb

* add fast infer attention * remove usless code * fix rocm compile bug * polish code * fix conflict * remove depulicate

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CINN]Add qkv unpack attn #64439

[CINN]Add qkv unpack attn #64439

phlrain commented May 20, 2024 •

edited

Loading

paddle-bot bot commented May 20, 2024

XiaoguangHu01 left a comment

Aurelius84 May 23, 2024

zyfncg left a comment

[CINN]Add qkv unpack attn #64439

[CINN]Add qkv unpack attn #64439

Conversation

phlrain commented May 20, 2024 • edited Loading

PR Category

PR Types

Description

paddle-bot bot commented May 20, 2024

XiaoguangHu01 left a comment

Choose a reason for hiding this comment

Aurelius84 May 23, 2024

Choose a reason for hiding this comment

zyfncg left a comment

Choose a reason for hiding this comment

phlrain commented May 20, 2024 •

edited

Loading