28 Sep 00:07

tprimak

v0.21.1

7d2fd50

v0.21.1

This is a patch release containing following changes to Intel MKL-DNN v0.21:

Fixed output channel blocking logic in forward AVX2 convolution that could lead to incorrect result or segfault (6accb47)
Fixed int8 grouped convolution for some shapes with the number of input or output channels not being a multiple of 8 on Intel AVX512 systems (878ac2d)

Assets 2

17 Sep 00:36

vpirogov

v0.21

9008d8a

v0.21

Performance optimizations

Improved int8 and fp32 GEMM and inner product performance.
Improved reorder performance for certain shapes.
Improved RNN, LSTM, GRU and LBR-GRU training performance.

New functionality

Added GELU activation support.

Thanks to the contributors

This release contains contributions from many Intel Performance Libraries developers. We would also like to thank everyone who asked questions and reported issues.

Assets 6

16 Sep 23:25

vpirogov

v1.1-rc

8d5fc05

v1.1-rc Pre-release

Pre-release

This is a release candidate for DNNL v1.1. Please provide feedback and report bugs in Github issues.

Assets 2

05 Sep 23:29

vpirogov

v0.20.5

0125f28

v0.20.5

This is a patch release containing following changes to Intel MKL-DNN v0.20.4:

Fixed out of bound memory access in GEMM-based grouped convolution weight update (3deeafa)
Fixed segmentation fault in AVX512 convolution for effective negative padding (f231ada)
Fixed correctness issue in strided depthwise convolution (d7484cb)

Assets 2

26 Aug 22:45

tprimak

v0.20.4

043ebed

v0.20.4

This is a patch release containing following changes to Intel MKL-DNN v0.20.3:

Fixed memory corruption issue in backward convolution with 1x1 kernel and asymmetrical strides (095ddb8)
Fixed correctness issue in backward convolution (eb33007)

Assets 2

26 Aug 23:25

anita-intel

v0.21-rc

a9696a4

v0.21-rc Pre-release

Pre-release

This is a release candidate for Intel MKL-DNN v0.21. Please provide feedback and report bugs in Github issues.

Assets 5

21 Aug 21:24

tprimak

v0.20.3

f2f33fe

v0.20.3

This is a patch release containing following changes to Intel MKL-DNN v0.20.2:

Fixed correctness issue in backward pooling with 3d-spatial and negative right padding (c0ddfec)

Assets 2

16 Aug 22:28

tprimak

v1.0.2

3289d3c

v1.0.2

This is a patch release containing following changes to Intel MKL-DNN v1.0.1:

Fixed issue with bfloat16 instructions detection in Xbyak (0f4ba11)
Fixed buffer size in packed GEMM (9764940)
Fixed offset calculation issue in weight update depthwise convolution in fp32 and bfloat16 kernels (6b9d412, 061499d)
Added check that size of generated kernel doesn't exceed the maximum allowed bound in fp32 forward and backward kernels (67e8cd2)
Various fixes in RNN primitive:
- Proper handling of packed GEMM in extended GEMM (4eb9f56)
- Force no-copy GEMM only for Intel AVX+ systems (2fbc8ba)
- Avoid unaligned pointers usage in vex instructions in GRU cell (a147c08)
- Fixed wrong dimension when creating GEMM primitive descriptor in reference RNN implementation for GPU (eb3c866)
- Fixed Tanh backward calculation in GPU RNN reference implementation (f6e4b97)
- Fixed pack GEMM dispatching for int8 (16b46c7)
- Addressed bugs in tests for RNNs (cf83e83, f7c2de2, 960f3f3)

Assets 10

16 Aug 22:49

tprimak

v0.20.2

735eb7b

v0.20.2

This is a patch release containing following changes to Intel MKL-DNN v0.20.1:

Fixed issue with bfloat16 instructions detection in Xbyak (b59bf2e)
Fixed offset calculation issue in weight update depthwise convolution in fp32 and bfloat16 kernels (ddc54e5, 0982b25)
Added check that size of generated kernel doesn't exceed the maximum allowed bound in fp32 forward and backward kernels (24abe20)
Various fixes in RNN primitive:
- Avoid unaligned pointers usage in vex instructions in GRU cell (8eb14f5)
- Addressed bugs in tests for RNNs (fa534ef, 3ac4db4)
- Fixed potential integer overflow (35c5f8a)

Assets 2

29 Jul 15:51

vpirogov

v1.0.1

61b9663

v1.0.1

This is a patch release containing following changes to Intel MKL-DNN v1.0.0:

updated version and soversion to <major>.<minor> and <major> respectively (952a777)

Assets 10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance optimizations

New functionality

Thanks to the contributors

Releases: oneapi-src/oneDNN

v0.21.1

v0.21

Performance optimizations

New functionality

Thanks to the contributors

v1.1-rc

v0.20.5

v0.20.4

v0.21-rc

v0.20.3

v1.0.2

v0.20.2

v1.0.1