-
Notifications
You must be signed in to change notification settings - Fork 2.4k
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[CPU][ARM] Fixed cvt_copy fast path for mha_single_token_kernel (#28265)
### Details: - This PR fixes incorrect cvt_copy rountine behavior inside mha_single_token kenrel on ARM platforms. In case __ARM_FEATURE_FP16_VECTOR_ARITHMETIC is defined on the system and fp32 inference scalar code path is chosen. - Additionally cvt_copy impl is refactored via template specialization for better readability - Follow-up after #28182
- Loading branch information
1 parent
6c610e9
commit b27eefb
Showing
4 changed files
with
70 additions
and
29 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters