Consider oneDNN instead of MKL for SGEMM #706

kpu · 2020-08-31T23:28:00Z

https://github.com/oneapi-src/oneDNN/ aka MKLDNN aka DNNL now has better performance for MT-size matrices: apache/mxnet#17980 . And it's open source. The same teams write the GEMM for MKL and oneDNN.

Would be worth benchmarking.

emjotde · 2020-08-31T23:30:43Z

Cool, should be easy to check?

emjotde · 2020-08-31T23:50:22Z

Is the expectation of better performance for any arch or AVX512 specific?

kpu · 2020-08-31T23:51:55Z

I've only bothered to measure AVX512 but we should check. Paging @sidkashyap.

XapaJIaMnu · 2020-10-09T15:02:48Z

https://github.com/XapaJIaMnu/marian-dev/tree/oneDNN

sidkashyap-at-Intel · 2020-11-18T13:37:17Z

OneDNN v1.7 improves the performance for older architectures too, including SSE4.1 for int8
https://github.com/oneapi-src/oneDNN/releases/tag/v1.7

We provided the Matrix Multiplications ranks from Marian� Inference for oneDNN to be optimized, the latest version includes those optimizations.

XapaJIaMnu · 2020-11-18T18:50:51Z

I have a branch with oneDNN. (You also need to disable cblas_sgemm_batched, which i forgot to do) https://github.com/XapaJIaMnu/marian-dev/tree/oneDNN

We need banchmarks to show that it's not slow. Unfortunately, there isn't much incentive to switch to oneDNN completely, as we still need MKL (or some sort of BLAS), because of FAISS requiring things like undefined reference to sorgqr_` @sidkashyap-at-Intel can we get a word to intel people to include some of those basic BLAS routines inside oneDNN?

sidkashyap-at-Intel · 2020-11-27T09:17:28Z

Hey @XapaJIaMnu, do we have a priority list of functions in MKL that need to be oneDNN? I will work with the oneDNN team to sort that out if possible.

XapaJIaMnu · 2020-11-27T18:09:12Z

@sidkashyap-at-Intel

../libmarian.a(VectorTransform.cpp.o): In function `(anonymous namespace)::eig(unsigned long, double*, double*, int)':
/home/nbogoych/marian-dev-tst/src/3rd_party/faiss/VectorTransform.cpp:428: undefined reference to `dsyev_'
/home/nbogoych/marian-dev-tst/src/3rd_party/faiss/VectorTransform.cpp:433: undefined reference to `dsyev_'
../libmarian.a(VectorTransform.cpp.o): In function `faiss::LinearTransform::transform_transpose(long, float const*, float*) const':
/home/nbogoych/marian-dev-tst/src/3rd_party/faiss/VectorTransform.cpp:291: undefined reference to `sgemm_'
../libmarian.a(VectorTransform.cpp.o): In function `matrix_qr(int, int, float*)':
/home/nbogoych/marian-dev-tst/src/3rd_party/faiss/VectorTransform.cpp:98: undefined reference to `sgeqrf_'
/home/nbogoych/marian-dev-tst/src/3rd_party/faiss/VectorTransform.cpp:103: undefined reference to `sgeqrf_'
/home/nbogoych/marian-dev-tst/src/3rd_party/faiss/VectorTransform.cpp:106: undefined reference to `sorgqr_'
../libmarian.a(VectorTransform.cpp.o): In function `faiss::LinearTransform::apply_noalloc(long, float const*, float*) const':
/home/nbogoych/marian-dev-tst/src/3rd_party/faiss/VectorTransform.cpp:266: undefined reference to `sgemm_'
../libmarian.a(VectorTransform.cpp.o): In function `faiss::LinearTransform::set_is_orthonormal()':
/home/nbogoych/marian-dev-tst/src/3rd_party/faiss/VectorTransform.cpp:317: undefined reference to `sgemm_'
../libmarian.a(VectorTransform.cpp.o): In function `faiss::PCAMatrix::prepare_Ab()':
/home/nbogoych/marian-dev-tst/src/3rd_party/faiss/VectorTransform.cpp:710: undefined reference to `sgemm_'
../libmarian.a(VectorTransform.cpp.o): In function `faiss::PCAMatrix::train(long, float const*)':
/home/nbogoych/marian-dev-tst/src/3rd_party/faiss/VectorTransform.cpp:559: undefined reference to `ssyrk_'
/home/nbogoych/marian-dev-tst/src/3rd_party/faiss/VectorTransform.cpp:597: undefined reference to `sgemm_'
/home/nbogoych/marian-dev-tst/src/3rd_party/faiss/VectorTransform.cpp:518: undefined reference to `ssyrk_'
collect2: error: ld returned 1 exit status

Basically, FAISS dependencies.

Cheers,

Nick

sidkashyap-at-Intel · 2020-11-27T18:12:01Z

Thank you, this will help in getting the request quantified. Will update on the progress soon.

sidkashyap-at-Intel · 2021-01-11T23:55:20Z

Had an internal discussion with @vpirogov from the oneDNN team, unfortunately the support for FAISS MKL dependencies cannot be addressed in oneDNN as it is outside the Deep Learning remit that the library focuses on.

emjotde · 2021-01-12T02:43:09Z

Hi, we need the FAISS support internally, but we can make it depend on finding MKL only?

ykim362 · 2021-01-12T04:00:16Z

Had an internal discussion with @vpirogov from the oneDNN team, unfortunately the support for FAISS MKL dependencies cannot be addressed in oneDNN as it is outside the Deep Learning remit that the library focuses on.

It's used in k-NN MT (https://arxiv.org/pdf/2010.00710.pdf) as well. I see several of those hash, search based methods in DL these days.

emjotde · 2021-01-12T04:22:59Z

@ykim362 very good point! All DNN with retrieval methods would rely on it.

vpirogov · 2021-01-14T00:23:27Z

@ykim362, @emjotde,

oneDNN is focused on deep learning algorithms. oneAPI has specialized data analytics library, oneDAL, that supports kNN and other machine learning algorithms.

kpu · 2022-02-16T14:26:28Z

MKL is blocking Wikipedia from deploying Marian because it is closed source.

kpu added enhancement question labels Aug 31, 2020

kpu mentioned this issue Aug 31, 2020

Fetch Content for MKL? #707

Open

graemenail mentioned this issue Jun 3, 2022

Add oneDNN #937

Closed

4 tasks

graemenail linked a pull request Sep 14, 2022 that will close this issue

oneDNN + MKL #967

Open

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consider oneDNN instead of MKL for SGEMM #706

Consider oneDNN instead of MKL for SGEMM #706

kpu commented Aug 31, 2020

emjotde commented Aug 31, 2020

emjotde commented Aug 31, 2020 •

edited

Loading

kpu commented Aug 31, 2020

XapaJIaMnu commented Oct 9, 2020

sidkashyap-at-Intel commented Nov 18, 2020 •

edited

Loading

XapaJIaMnu commented Nov 18, 2020 •

edited

Loading

sidkashyap-at-Intel commented Nov 27, 2020

XapaJIaMnu commented Nov 27, 2020

sidkashyap-at-Intel commented Nov 27, 2020

sidkashyap-at-Intel commented Jan 11, 2021

emjotde commented Jan 12, 2021 •

edited

Loading

ykim362 commented Jan 12, 2021

emjotde commented Jan 12, 2021

vpirogov commented Jan 14, 2021

kpu commented Feb 16, 2022

Consider oneDNN instead of MKL for SGEMM #706

Consider oneDNN instead of MKL for SGEMM #706

Comments

kpu commented Aug 31, 2020

emjotde commented Aug 31, 2020

emjotde commented Aug 31, 2020 • edited Loading

kpu commented Aug 31, 2020

XapaJIaMnu commented Oct 9, 2020

sidkashyap-at-Intel commented Nov 18, 2020 • edited Loading

XapaJIaMnu commented Nov 18, 2020 • edited Loading

sidkashyap-at-Intel commented Nov 27, 2020

XapaJIaMnu commented Nov 27, 2020

sidkashyap-at-Intel commented Nov 27, 2020

sidkashyap-at-Intel commented Jan 11, 2021

emjotde commented Jan 12, 2021 • edited Loading

ykim362 commented Jan 12, 2021

emjotde commented Jan 12, 2021

vpirogov commented Jan 14, 2021

kpu commented Feb 16, 2022

emjotde commented Aug 31, 2020 •

edited

Loading

sidkashyap-at-Intel commented Nov 18, 2020 •

edited

Loading

XapaJIaMnu commented Nov 18, 2020 •

edited

Loading

emjotde commented Jan 12, 2021 •

edited

Loading