-
Notifications
You must be signed in to change notification settings - Fork 126
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Consider oneDNN instead of MKL for SGEMM #706
Comments
Cool, should be easy to check? |
Is the expectation of better performance for any arch or AVX512 specific? |
I've only bothered to measure AVX512 but we should check. Paging @sidkashyap. |
OneDNN v1.7 improves the performance for older architectures too, including SSE4.1 for int8 We provided the Matrix Multiplications ranks from Marian� Inference for oneDNN to be optimized, the latest version includes those optimizations. |
I have a branch with oneDNN. (You also need to disable cblas_sgemm_batched, which i forgot to do) https://github.com/XapaJIaMnu/marian-dev/tree/oneDNN We need banchmarks to show that it's not slow. Unfortunately, there isn't much incentive to switch to oneDNN completely, as we still need MKL (or some sort of BLAS), because of FAISS requiring things like |
Hey @XapaJIaMnu, do we have a priority list of functions in MKL that need to be oneDNN? I will work with the oneDNN team to sort that out if possible. |
Basically, FAISS dependencies. Cheers, Nick |
Thank you, this will help in getting the request quantified. Will update on the progress soon. |
Had an internal discussion with @vpirogov from the oneDNN team, unfortunately the support for FAISS MKL dependencies cannot be addressed in oneDNN as it is outside the Deep Learning remit that the library focuses on. |
Hi, we need the FAISS support internally, but we can make it depend on finding MKL only? |
It's used in k-NN MT (https://arxiv.org/pdf/2010.00710.pdf) as well. I see several of those hash, search based methods in DL these days. |
@ykim362 very good point! All DNN with retrieval methods would rely on it. |
MKL is blocking Wikipedia from deploying Marian because it is closed source. |
https://github.com/oneapi-src/oneDNN/ aka MKLDNN aka DNNL now has better performance for MT-size matrices: apache/mxnet#17980 . And it's open source. The same teams write the GEMM for MKL and oneDNN.
Would be worth benchmarking.
The text was updated successfully, but these errors were encountered: