v0.11
Performance optimizations
- Improved convolution performance on future Intel(R) Xeon Phi(TM) processors with AVX512_4FMAPS and AVX512_4VNNIW instruction groups support
- Improved convolution performance on Intel(R) Xeon processors with Intel(R) AVX512 instruction set support
- Improved performance of GEMM-based convolutions for small minibatches
- Improved performance of Winograd convolution algorithm on Intel Xeon Phi processors.
New functionality
- Added backpropagation support for dilated convolution.
- Eltwise primitive is extended with support for square, abs, square root, linear, bounded ReLU, soft ReLU and logistic.
Usability improvements
- Added macOS* support.
Breaking changes to the API
- All real-value op descriptors' parameters now have
float
data type (previouslydouble
). The change breaks C-API backward compatibility for sum primitive. Please refer to 0bbb22e for details. C++ API maintains backward compatibility.
Thanks to the contributors
This release contains contributions from many Intel(R) Performance Libraries developers as well as Yu Yang @reyoung, Vladimir Mironov @vamironov, Nishant Patel @nbpatel, Leona Cook @indie, Jayaram Bobba @jbobba, Elena Gvozdeva. We would also like to thank everyone who asked questions and reported issues.
* Other names and brands may be claimed as the property of others.