-
Notifications
You must be signed in to change notification settings - Fork 784
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Performance improvements for take #279
Comments
Some first experimenting didn't reveal a lot of options. |
@Dandandan I once checked the assembly output of what the Here's some example assembly output: https://godbolt.org/z/1nsq4reT7 That could explain why you didn't see perf changes. |
Thanks! That makes sense. It seems further speedups might come only from somehow removing bound checks and SIMD gather. |
SIMD gather is not being used atm right? I remember looking for this operation in |
Yeah I also did some googling - seems not supported in |
These are good opportunities to use |
Is your feature request related to a problem or challenge? Please describe what you are trying to do.
Take is a critical function that is used in quite some code and is hot up in profiling. It seems like we should be able to speed it up.
Two improvements we can do:
Describe the solution you'd like
Check whether we can eliminate / avoid bound checks.
Add SIMD versions of take
Describe alternatives you've considered
Additional context
The text was updated successfully, but these errors were encountered: