Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
fix falcon-40b accuracy issue (microsoft#4895)
This [PR](microsoft#4721) added the "DecoderLayer":glmtype. It will cause the Falcon model to choose "glmtype" fused_qkv_type. Falcon model (including Falcondecoderlayer) needs to choose 'bloomtype' explicitly. Co-authored-by: Michael Wyatt <[email protected]>
- Loading branch information