patch avx512 dispatch for 8bit_direct for PR 3853 #3871

mengdilin · 2024-09-19T00:57:37Z

8bit_direct no longer has a huge regression https://www.internalfb.com/phabricator/paste/view/P1596996696 because we will branch to avx2 if d (size of the input vectors) is not a multiple of 32 but a multiple of 16.

Differential Revision: D62990470

…oad the correct commit data. Differential Revision: D62989543

Summary: Context in https://www.internalfb.com/diff/D62989543?dst_version_fbid=830141322266715&transaction_fbid=927659569187374 8bit_direct no longer has a huge regression https://www.internalfb.com/phabricator/paste/view/P1596996696 because we will branch to avx2 if `d` (size of the input vectors) is not a multiple of 32 but a multiple of 16. Differential Revision: D62990470

facebook-github-bot · 2024-09-19T00:57:54Z

This pull request was exported from Phabricator. Differential Revision: D62990470

mengdilin and others added 2 commits September 18, 2024 16:22

Generated from a GitHub Pull Request. Run 'jf sync' on this diff to l…

d06b600

…oad the correct commit data. Differential Revision: D62989543

facebook-github-bot added the CLA Signed label Sep 19, 2024

facebook-github-bot added the fb-exported label Sep 19, 2024

asadoughi added the Performance label Sep 20, 2024

asadoughi added the ignore label Oct 22, 2024

Provide feedback