-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
perf: exploration of better matmul algorithms #69
base: main
Are you sure you want to change the base?
Conversation
Check out this pull request on See visual diffs & provide feedback on Jupyter Notebooks. Powered by ReviewNB |
for more information, see https://pre-commit.ci
Codecov ReportAll modified and coverable lines are covered by tests ✅
Additional details and impacted files@@ Coverage Diff @@
## main #69 +/- ##
============================================
- Coverage 100.00% 65.07% -34.93%
============================================
Files 8 8
Lines 567 567
Branches 88 88
============================================
- Hits 567 369 -198
- Misses 0 193 +193
- Partials 0 5 +5
Flags with carried forward coverage won't be shown. Click here to find out more. ☔ View full report in Codecov by Sentry. |
…cpu into matrix-multiply-profiling
for more information, see https://pre-commit.ci
for more information, see https://pre-commit.ci
for more information, see https://pre-commit.ci
for more information, see https://pre-commit.ci
If you're going through all the efforts of writing a cublas implementation, might investigated compiled extensions like cython or a rust extension. Both have interop with numpy which is pretty straightforward |
This adds a notebook that explores different ways to do the
V = Z.dot(Z.T.conj())
calculation.Ideas that I haven't yet explored: