Skip to content

Commit

Permalink
fix falcon-40b accuracy issue (microsoft#4895)
Browse files Browse the repository at this point in the history
This [PR](microsoft#4721) added the
"DecoderLayer":glmtype. It will cause the Falcon model to choose
"glmtype" fused_qkv_type. Falcon model (including Falcondecoderlayer)
needs to choose 'bloomtype' explicitly.

Co-authored-by: Michael Wyatt <[email protected]>
  • Loading branch information
Yejing-Lai and mrwyattii authored Jan 10, 2024
1 parent 43eba77 commit 16c265c
Showing 1 changed file with 2 additions and 1 deletion.
3 changes: 2 additions & 1 deletion deepspeed/module_inject/fusedqkv_utils.py
Original file line number Diff line number Diff line change
Expand Up @@ -38,7 +38,8 @@ def prepare_tp_fused_qkvw(module_str, src, mp_size, gpu_index):
"MptBlock": 'glmtype',
"BaichuanLayer": 'glmtype',
"DecoderLayer": 'glmtype',
"GPTBigCodeBlock": 'bigcodetype'
"FalconDecoderLayer": 'bloomtype',
"GPTBigCodeBlock": 'bigcodetype',
}

def _codegen_type_transpose(input, mp_size, codegen_mp_num=4):
Expand Down

0 comments on commit 16c265c

Please sign in to comment.