[common] merge ChatGLM2Attention::forward into Attention::forward #86

a3213105 · 2023-11-27T07:55:02Z

add epsilon param into LayerNorm to align with RmsNorm to unify Norm APIs (LayerNorm doesn't use this param)

extend qk_shape from 4 to 5 in attention.h, to pass key_head_num which is needed by rotary_embedding_chatglm2 for multi-query-attention.

add epsilon param into LayerNorm to align with RmsNorm (LayerNorm doesn't use this param) extend qk_shape from 4 to 5 in attention.h, to pass key_head_num into rotary_embedding_chatglm2 for multi-query-attention.

Duyi-Wang · 2023-11-28T09:14:54Z

Please rebase to the main and fix the conflicts.

a3213105 · 2023-11-29T06:16:20Z

Please rebase to the main and fix the conflicts.

done

add epsilon param into LayerNorm to align with RmsNorm (LayerNorm doesn't use this param) extend qk_shape from 4 to 5 in attention.h, to pass key_head_num into rotary_embedding_chatglm2 for multi-query-attention.

merge ChatGLM2Attention::forward into Attention::forward

aee86ac

add epsilon param into LayerNorm to align with RmsNorm (LayerNorm doesn't use this param) extend qk_shape from 4 to 5 in attention.h, to pass key_head_num into rotary_embedding_chatglm2 for multi-query-attention.

a3213105 changed the title ~~merge ChatGLM2Attention::forward into Attention::forward~~ [common] merge ChatGLM2Attention::forward into Attention::forward Nov 27, 2023

Merge branch 'main' into chatglm2_code_refactor

0e508f6

Duyi-Wang approved these changes Dec 1, 2023

View reviewed changes

Duyi-Wang merged commit 2830926 into intel:main Dec 1, 2023
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[common] merge ChatGLM2Attention::forward into Attention::forward #86

[common] merge ChatGLM2Attention::forward into Attention::forward #86

a3213105 commented Nov 27, 2023

Duyi-Wang commented Nov 28, 2023

a3213105 commented Nov 29, 2023

[common] merge ChatGLM2Attention::forward into Attention::forward #86

[common] merge ChatGLM2Attention::forward into Attention::forward #86

Conversation

a3213105 commented Nov 27, 2023

Duyi-Wang commented Nov 28, 2023

a3213105 commented Nov 29, 2023