-
Notifications
You must be signed in to change notification settings - Fork 1.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Improve count
vectorization: replace popcnt
implementation with vector counting
#4614
Merged
StephanTLavavej
merged 27 commits into
microsoft:main
from
AlexGuteniev:count_is_not_find
Apr 26, 2024
Merged
Improve count
vectorization: replace popcnt
implementation with vector counting
#4614
StephanTLavavej
merged 27 commits into
microsoft:main
from
AlexGuteniev:count_is_not_find
Apr 26, 2024
Commits on Apr 21, 2024
-
Configuration menu - View commit details
-
Copy full SHA for b1aacab - Browse repository at this point
Copy the full SHA b1aacabView commit details -
Configuration menu - View commit details
-
Copy full SHA for 6f134e8 - Browse repository at this point
Copy the full SHA 6f134e8View commit details -
Configuration menu - View commit details
-
Copy full SHA for 35d61ac - Browse repository at this point
Copy the full SHA 35d61acView commit details
Commits on Apr 22, 2024
-
Configuration menu - View commit details
-
Copy full SHA for ce8d8a5 - Browse repository at this point
Copy the full SHA ce8d8a5View commit details -
Configuration menu - View commit details
-
Copy full SHA for af456db - Browse repository at this point
Copy the full SHA af456dbView commit details -
Configuration menu - View commit details
-
Copy full SHA for c232f62 - Browse repository at this point
Copy the full SHA c232f62View commit details -
Configuration menu - View commit details
-
Copy full SHA for 498408c - Browse repository at this point
Copy the full SHA 498408cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 48efb24 - Browse repository at this point
Copy the full SHA 48efb24View commit details -
Configuration menu - View commit details
-
Copy full SHA for 403ef98 - Browse repository at this point
Copy the full SHA 403ef98View commit details -
Configuration menu - View commit details
-
Copy full SHA for 983e93a - Browse repository at this point
Copy the full SHA 983e93aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 64b6fe6 - Browse repository at this point
Copy the full SHA 64b6fe6View commit details -
Configuration menu - View commit details
-
Copy full SHA for 85263d8 - Browse repository at this point
Copy the full SHA 85263d8View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0a92efc - Browse repository at this point
Copy the full SHA 0a92efcView commit details -
Configuration menu - View commit details
-
Copy full SHA for d816df0 - Browse repository at this point
Copy the full SHA d816df0View commit details -
Configuration menu - View commit details
-
Copy full SHA for dedc0cc - Browse repository at this point
Copy the full SHA dedc0ccView commit details
Commits on Apr 24, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 015d4f7 - Browse repository at this point
Copy the full SHA 015d4f7View commit details
Commits on Apr 25, 2024
-
Fix
_Count_traits_4::_Max_count
.0x1FFF'FFFF would be strict for the SSE4.2 codepath, but 0xFFF'FFFF is strict for the AVX2 codepath.
Configuration menu - View commit details
-
Copy full SHA for 7e39a04 - Browse repository at this point
Copy the full SHA 7e39a04View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8d4aab5 - Browse repository at this point
Copy the full SHA 8d4aab5View commit details -
For clarity, scope
__m128i _Count_vector
to each iteration of the S……SE4.2 loop. It never carries information across iterations.
Configuration menu - View commit details
-
Copy full SHA for 50d610a - Browse repository at this point
Copy the full SHA 50d610aView commit details -
For the AVX2 loop, scope
__m256i _Count_vector
separately for the m……ain loop and tail. This allows us to avoid subtracting 1 from `_Max_count`, making it more similar to the SSE4.2 loop.
Configuration menu - View commit details
-
Copy full SHA for c06d4ff - Browse repository at this point
Copy the full SHA c06d4ffView commit details -
Configuration menu - View commit details
-
Copy full SHA for 3f95815 - Browse repository at this point
Copy the full SHA 3f95815View commit details -
Configuration menu - View commit details
-
Copy full SHA for 22b561b - Browse repository at this point
Copy the full SHA 22b561bView commit details -
Configuration menu - View commit details
-
Copy full SHA for e587698 - Browse repository at this point
Copy the full SHA e587698View commit details -
Configuration menu - View commit details
-
Copy full SHA for 87749d2 - Browse repository at this point
Copy the full SHA 87749d2View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2e1d6c7 - Browse repository at this point
Copy the full SHA 2e1d6c7View commit details -
Configuration menu - View commit details
-
Copy full SHA for fc6dcbd - Browse repository at this point
Copy the full SHA fc6dcbdView commit details -
Configuration menu - View commit details
-
Copy full SHA for c5dd5c2 - Browse repository at this point
Copy the full SHA c5dd5c2View commit details
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.