[OpenBLAS_jll] Update to new build with BFloat16 kernels #53059

giordano · 2024-01-26T00:42:24Z

This PR also

drops a patch (deps/patches/neoverse-generic-kernels.patch) not needed anymore for an old bug fixed upstream in OpenBLAS. This results in ~5x speedup in the computation of BLAS.nrm2 (and hence LinearAlgebra.norm for vectors longer than LinearAlgebra.NRM2_CUTOFF (== 32) elements) when the neoversen1 kernels are used, e.g. by default on all Apple Silicon CPUs
adds a regression test for the above bug
updates other patches when building openblas from source

Corresponding PR in Yggdrasil: JuliaPackaging/Yggdrasil#7202. CC: @imciner2.

imciner2 · 2024-01-26T00:51:24Z

We also have a new SVE patch in Yggdrasil: https://github.com/JuliaPackaging/Yggdrasil/blob/master/O/OpenBLAS/OpenBLAS%400.3.26/bundled/patches/90-darwin-sve.patch that I don't see included here. It is needed for the Apple aarch64 BFloat enabled builds.

The new build also includes patches to * improve multithreading * fix builds of BFloat16 kernels on AVX512BF16

…alues This would have caught an old upstream bug in OpenBLAS, which was later fixed in v0.3.14.

giordano · 2024-01-26T01:13:15Z

Oops, added, thanks!

ctkelley · 2024-01-26T10:53:17Z

Once this is in the nightly, how do I make BLAS calls? Is there a bgemm out there. I'm pretty sure that @timothyaldendavis cares too

ViralBShah · 2024-01-26T14:36:56Z

Yeah you can just ccall those functions, but you will need to use the types from BFloat16s.jl etc.

giordano added building Build system, or building Julia or its dependencies linear algebra Linear algebra external dependencies Involves LLVM, OpenBLAS, or other linked libraries JLLs labels Jan 26, 2024

giordano requested a review from ViralBShah January 26, 2024 00:42

giordano added 2 commits January 26, 2024 01:12

[OpenBLAS_jll] Update to new build with BFloat16 kernels

e4793b0

The new build also includes patches to * improve multithreading * fix builds of BFloat16 kernels on AVX512BF16

[LinearAlgebra] Add regression test for BLAS.nrm2 with non-finite v…

e758c30

…alues This would have caught an old upstream bug in OpenBLAS, which was later fixed in v0.3.14.

giordano force-pushed the mg/openblas branch from a12a9aa to e758c30 Compare January 26, 2024 01:12

ViralBShah approved these changes Jan 26, 2024

View reviewed changes

giordano merged commit 5d4d6ab into JuliaLang:master Jan 26, 2024
7 checks passed

giordano deleted the mg/openblas branch January 26, 2024 10:18

imciner2 mentioned this pull request Jan 26, 2024

apply OpenBLAS_jll v0.3.23+4 patch #53074

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[OpenBLAS_jll] Update to new build with BFloat16 kernels #53059

[OpenBLAS_jll] Update to new build with BFloat16 kernels #53059

giordano commented Jan 26, 2024

imciner2 commented Jan 26, 2024

giordano commented Jan 26, 2024

ctkelley commented Jan 26, 2024 •

edited by ViralBShah

Loading

ViralBShah commented Jan 26, 2024

[OpenBLAS_jll] Update to new build with BFloat16 kernels #53059

[OpenBLAS_jll] Update to new build with BFloat16 kernels #53059

Conversation

giordano commented Jan 26, 2024

imciner2 commented Jan 26, 2024

giordano commented Jan 26, 2024

ctkelley commented Jan 26, 2024 • edited by ViralBShah Loading

ViralBShah commented Jan 26, 2024

ctkelley commented Jan 26, 2024 •

edited by ViralBShah

Loading