Update dependency libopenblas to v0.3.33 by renovate[bot] · Pull Request #12 · MialLewis/mantid

renovate · 2026-04-10T16:10:24Z

This PR contains the following updates:

Package	Update	Change
libopenblas (source)	patch	`==0.3.27` → `==0.3.33`

Release Notes

OpenMathLib/OpenBLAS (libopenblas)

`v0.3.33`: OpenBLAS 0.3.33 version

general:

fixed an incorrect cast in the SBGEMM test case that could lead to spurious test failures
fixed an invalid memory access in the converted C version of the CBLAS tests
made the BIGNUMA setting automatic when the number of cores exceeds 256
Imported recent updates from Reference-LAPACK to realign with its upcoming 3.13.0 release:
- Implement ?LARF1F and ?ORM2R (Reference-LAPACK PRs 1019,1020,1196,1257)
- Change loop order in ?GETC2 to improve performance (Reference-LAPACK PR 1023)
- Change WORK array dimension in ?GELQS/?GEQRS (Reference-LAPACK PR 1094)
- Add NaN checks for input matrix A in ?GEEV (Reference-LAPACK PR 1136)
- Fix support for jobu/v in LAPACKE_?GESVDQ_WORK (Reference-LAPACK PRs 1146,1221)
- Fix display of version number in LAPACK testsuite (Reference-LAPACK PR 1149)
- Fix DGGES test seed to avoid bad matrix cases (Reference-LAPACK PR 1187)
- Fix truncation of large WORK array sizes in ZHE (Reference-LAPACK PR 1195)
- Fix overwriting of LDSWORK parameter in ?TRSYL3 (Reference-LAPACK PR 1206)
- Fix overwriting of error states in some EIG tests (Reference-LAPACK PR 1207)
- Remove unused parameter in DORBDB3/ZUNBDB3 (Reference-LAPACK PR 1209)
- Re-enable testing of ?BB and ?GG driver functions (Reference-LAPACK PR 1211)
- Fix workspace size calculation in ?TGSEN (Reference-LAPACK PR 774)
- Fix typos in the EIG DMD tests and initialized the cutoff variable (PR 1212,1228)
- Optimized looping in ?LACPY/?LASCL/?LANTR with fat matrix and UPLO=L (PR 1251)

arm64:

worked around a serious miscompilation of the DDOT kernel by GCC15, affecting
most non-SVE targets, and SVE targets in the case of non-unit array stride)
fixed an accuracy issue in the GEMV kernel for Neoverse V1 and other SVE targets
fixed broken STRMM and SSYMM in DYNAMIC_ARCH builds when running on non-SME hardware
added an optimized SHGEMM kernel for Neoverse N2
fixed DYNAMIC_ARCH builds under Windows on Arm
Added autodetection of Cortex A75/A76 in DYNAMIC_ARCH builds
Added autodetection of Neoverse V3, currently supported through V2 kernels
Re-added support for the "VORTEX" target in DYNAMIC_ARCH builds with DYNAMIC_LIST
Fixed CMake-based builds that use the "Ninja" generator

loongarch64:

fixed a build failure due to missing support for the new half-precision float type
fixed a long-standing bug in asserting 64bit capability in the c_check helper script

x86_64:

added a workaround for miscompilation of the AVX512 GEMM kernels by LLVM on Windows
fixed a build failure in the LAED3 code when compiling with MinGW on Windows
fixed CMake-based compilation with the NVIDIA HPC compiler
Fixed CMake-based builds that use the "Ninja" generator

wasm:

added optimized kernels for STRSM and DTRSM

md5sums:
96c5cd9 OpenBLAS-0.3.33.tar.gz
81637d0 OpenBLAS-0.3.33.zip
153b444 OpenBLAS-0.3.33-x86.zip
93022c3 OpenBLAS-0.3.33-x64.zip
e30aab9 OpenBLAS-0.3.33-x64-64.zip
c7b2134 OpenBLAS-0.3.33-woa64-static.zip
90cfd16 OpenBLAS-0.3.33-woa64-dll.zip
4b052df OpenBLAS-0.3.33-woa64-64-static.zip
010b1ac OpenBLAS-0.3.33-woa64-64-dll.zip

`v0.3.32`: OpenBLAS 0.3.32 version

general:

Moved the preliminary support for a Web Assembly target to its own WASM
architecture and WASM128_GENERIC target
Fixed a potential performance difference between dedicated compilation for
a target and its representation in DYNAMIC_ARCH builds by making additional
cpu-specific parameters available to the DYNAMIC_ARCH configuration
Fixed the reimplementation of LAPACK ?GESV to conform to the reference (i.e.
compute the LU factorization even when NRHS is zero)
Improved the error message that is displayed when the compile-time allocation
of memory buffers is exceeded
Fixed a problem with non-serialized accesses to parallelized SYRK by concurrent
callers
Fixed an ABI mismatch in the internal version of CDOT/ZDOT used by the C fallback
versions of the LAPACK source
Improved the f_check script for detecting the Fortran compiler to handle embedded
dashes in path names
Fixed several memory access issues in the utests that were detected by Address
Sanitizer
Fixed Makefile errors in cases where only a subset of precision types was selected
Fixed missing function errors in Makefile builds without LAPACK or without threads
Fixed a syntax error in the benchmarks Makefile
Fixed compiler warnings in the CBLAS testsuite
Fixed the OpenMP compiler option used with the Intel Ifx compiler
Updated the README sections on supported cpus and operating systems, and added
notes pertaining to JAVA
Updated the documentation page for supported BLAS-like extensions
included fixes from the Reference-LAPACK project:
- Improved step length selection in the fallback path of ?LAED4
  (Reference-LAPACK PR 1191)
- Rounding up of LWORK and removal of redundant type conversions in the GVD
  functions (Reference-LAPACK PR 1202)
- internal errors were getting ignored in calculation of selected eigenvalues
  (Reference-LAPACK PR 1204)

arm64:

Fixed a potential miscompilation of the SDOT/DDOT/DSDOT kernels
Fixed DYNAMIC_ARCH compilation with CMake and compilers lacking SVE support
Improved the performance of BGEMM and SBGEMM kernels for Neoverse V2
Added optimized SSUM and DSUM kernels for Neoverse N1
Added preliminary support for Neoverse V3 cpus as NEOVERSEV2
Added cpu autodetection of Cortex A725 and X925 cpus
Fixed a CMake build problem with flang on Mac OS
Fixed build problems with gcc versions 12 and earlier that do not support fp16
Fixed compilation of GEMM kernels for VORTEXM4/ARMV9SME without multithreading
Fixed the optimized CDOT/ZDOT kernel to compile with LLVM under Windows on Arm
Renamed the copy of the DllMain function used in static linking on MS Windows to
OpenBLASDllMain to avoid symbol name conflicts with other libraries

loongarch64:

fixed POTRF returning wrong results on LA464 due to a wrong parameter setting

power:

Fixed compilation problems caused by missing support for half-precision floats (FP16)
Fixed a potential miscompilation of the POWER10 DGEMV kernel by limiting its optimization
level
Fixed a SCAL issue on PPCG4/PPC970 running Linux
Worked around a SCAL issue on PPC970 running FreeBSD by switching to the generic C kernels

riscv64:

Optimized the CROT/ZROT kernel for vector length 128 in the non-unit stride path
Improved SBGEMM/SHGEMM and related helper functions for type conversion
Fixed probing for BFLOAT16 support in DYNAMIC_ARCH cpu detection at runtime

x86_64:

Fixed a potential miscompilation (by gcc 15.x) of the AVX512 SGEMM kernel for "small"
matrix sizes
Fixed the SROT and DROT kernels for Haswell to have consistent (FMA) rounding
in the main loop and tail call
Added automatic detection of Intel Arrow Lake H/U, Panther Lake and Jasper Lake
Added automatic detection of Intel Emerald Rapids and upcoming cpu models
Updated the cache size translation table in the cpu model autodetection code
Improved cpu detection fallback to also include Nehalem as a non-AVX option
Fixed a Makefile build issue with clang and the SkylakeX SGEMM kernel
Renamed the copy of the DllMain function used in static linking on MS Windows to
OpenBLASDllMain to avoid symbol name conflicts with other libraries

wasm:

Added optimized intrinsics kernels for SGEMM and DGEMM as well as DOT, ROT and SUM

md5sums:
c2e1ba0 OpenBLAS-0.3.32.zip
021eb76 OpenBLAS-0.3.32.tar.gz
8ca8a65 OpenBLAS-0.3.32-x86.zip
859ed95 OpenBLAS-0.3.32-x64.zip
7dd8a3a OpenBLAS-0.3.32-x64-64.zip
dd3f0c7 OpenBLAS-0.3.32-woa64-dll.zip
b0d4512 OpenBLAS-0.3.32-woa64-static.zip
e97d9fc OpenBLAS-0.3.32-woa64-64-dll.zip
d5e7126 OpenBLAS-0.3.32-woa64-64-static.zip

`v0.3.31`: OpenBLAS 0.3.31 version

general:

reverted a matrix partitioning optimization from 0.3.30 that could lead to
race conditions and subsequent invalid results in GEMM
added the bfloat16 extensions BGEMM and BGEMV
added a BLAS interface for the ?GEMM_BATCH extensions
added the BLAS extensions ?GEMM_BATCH_STRIDED and their CBLAS interface
added the basic infrastructure for half-precision float (FP16) format
using SH prefix
reimplemented the LAPACK SLAED3/DLAED3 function using multithreading, thereby
improving the performance of the SSYEVD/DSYEVD eigensolver for symmetric matrices
on all platforms
limited the number of retries for initial memory allocation to avoid infinite
hanging on low-memory systems
fixed a thread lockup situation encountered with python 3.9 or older and numpy
introduced a problem size threshold for multithreading in STRMV/DTRMV
introduced a problem size threshold for multithreading in CHER/CHER2/CHPR/CHPR2
and ZHER/ZHER2/ZHPR/ZHPR2
improved the problem size thresholds for multithreading in SGER/DGER
improved autodetection of the Fortran compiler
fixed passing of the INTERFACE64=1 option to the flang-new compiler
fixed a potential deadlock in multithreaded code after calling fork()
fixed builds using CMake on FreeBSD
fixed builds using CMake from within Cygwin on Windows
fixed builds using CMake and the NVHPC compiler on ARM64
fixed CMake build error from misdetecting compiler or OpenMP versions
improved contents of the CMake-generated OpenBLASConfig.cmake file
added support for cross-compilation to RISCV targets via CMake
fixed cross-compilation to x86 targets from non-x86 architectures
fixed failure to install cblas.h if NO_CBLAS=0 was specified
fixed missing user-defined pre- and postfixes on functions in lapack.h,lapacke.h
included fixes from the Reference-LAPACK project:
- fix ordering bug in ?LAED/?LASD (Reference-LAPACK PR 1140)
- revert changes in ?GEEV from PR 1129 (Reference-LAPACK PR 1142)
- fix workspace allocation in LAPACKE_?TRSEN (Reference-LAPACK PR 1144)

riscv:

added optimized SBGEMM kernels for ZVL128B and ZVL256B targets
added optimized SHGEMM kernels for ZVL128B and ZVL256B targets
added optimized SBGEMV and SHGEMV kernels for ZVL128B/ZVL256B
improved performance of the GEMV kernel for ZVL256B
improved the performance of the CROT and ZROT kernels for ZVL128B and x280
improved the detection of RVV1.0 capability
improved performance of the matrix packing helper functions for ZVL128B and ZVL256B
improved performance of OMATCOPY for ZVL128B and ZVL256B

arm:

fixed spurious executable stack in the getarch utility

arm64:

fixed spurious executable stack in the getarch utility
fixed compiler warnings arising from the timer macro RPCC
fixed cache size detection for Qualcomm Oryon under Windows on Arm
fixed argument handling in the default SVE kernel for SDOT/DDOT
building the BFLOAT16 kernels is now enabled by default
improved the overall performance of GEMM,SYMM and HEMM on A64FX
improved the performance of SDOT/DDOT on A64FX
improved the multithreading performance of SDOT/DDOT on A64FX by
introduction of a throttling table matching thread count to problem size
improved the performance of SGER/DGER on A64FX and NEOVERSEV1
improved the multithreading performance of GEMM on A64FX and NEOVERSEV1
improved the performance of the GEMV kernel for SVE-capable targets
improved the multithreading performance of SGEMM on NEOVERSEV1 and V2
added optimized SAXPY/DAXPY SVE kernels for A64FX and NEOVERSEV1
added optimized BGEMM and BGEMV kernels for NEOVERSEV1
added an optimized BGEMM kernel for NEOVERSEN2
added support for the NEOVERSEV2 cpu
added dedicated support for the Apple M4 cpu as VORTEXM4
added optimized SGEMM/SSYMM/STRMM/SSYRK/SSYR2K for SME-capable targets
(ARMV9SME and VORTEXM4)
improved the precision of the SNRM2 kernel
added cpu autodetection and compiler settings for Ampere One processors
fixed cpu autodetection for Apple M systems running Linux
fixed building on MacOS with AppleClang,gfortran and xcode v16 or newer
fixed several errors in the C code replacements for the complex and double
precision complex LAPACK functions that get used (only) when compiling with
Microsoft C and NOFORTRAN=1 under MS Windows

power:

added initial support for the POWER11 architecture
improved performance of DGEMM and DGEMV on POWER10
fixed the default compiler flags to use "-O3" instead of the possibly unsafe
"-Ofast"
fixed building under MacOS (for old G4 Macs) with CMake
fixed potential miscompilation of DGEMV and other assembly kernels by gcc15.1
fixed compilation with recent versions of flang

loongarch64:

fixed warnings and potential inaccuracies arising from incorrect saving of registers
fixed enumeration of logical cores on big NUMA servers
fixed building with LLVM and the INTERFACE64=1 option

x86:

fixed building the GEMM3M kernels for the GENERIC target
fixed several errors in the C code replacements for the complex and double
precision complex LAPACK functions that get used (only) when compiling with
Microsoft C and NOFORTRAN=1 under MS Windows

x86_64:

added cpu autodetection for Intel Lunar Lake (Core Ultra 200V)
changed all ?MIN and ?MAX assembly kernels to use unaligned operations
fixed several errors in the C code replacements for the complex and double
precision complex LAPACK functions that get used (only) when compiling with
Microsoft C and NOFORTRAN=1 under MS Windows
fixed potential crashes in builds for Cooper Lake, Sapphire Rapids or Zen5 cpus
under MS Windows

zarch:

added support for building with CMake

sparc:

fixed a potential crash in the DNRM2 kernel

md5sums:
0505027 OpenBLAS-0.3.31.tar.gz
5480a90 OpenBLAS-0.3.31.zip
e9a7262 OpenBLAS-0.3.31-x86.zip
c6d0e83 OpenBLAS-0.3.31-x64.zip
437f0c0 OpenBLAS-0.3.31-x64-64.zip
a0c1f8b OpenBLAS-0.3.31-woa64-static.zip
fb16c99 OpenBLAS-0.3.31-woa64-dll.zip
53d3bb3 OpenBLAS-0.3.31-woa64-64-static.zip
27474c9 OpenBLAS-0.3.31-woa64-64-dll.zip

`v0.3.30`: OpenBLAS 0.3.30 version

general:

fixed an installation problem with the thread safety test in gmake builds
fixed spurious overwriting of an input array in complex GEMMT/GEMMTR
fixed naming of GEMMTR in error messages from XERBLA
fixed compilation of SBGEMMT/SBGEMMTR in CMake builds
fixed the implementation of ?NRM2 to handle INCX=0 correctly
removed tests for CSROT and ZDROT that relied on unspecified behavior
fixed a performance regression in multithreaded GEMM that was particularly
serious on POWER targets
fixed linking issues when using LLVM's flang-new with gmake
fixed a potential thread safety problem with C11 atomic operations
further improved the workload partitioning in parallel GEMM
fixed omission of LAPACKE interfaces for CGESVDQ,CTRSYL3 and ?GEQPF in
CMake builds
fixed mishandling of setting NO_LAPACK to FALSE, and incorrect dependencies
for LAPACK function SPMV in CMake builds
added explicit CMake options for building LAPACKE and shared libraries
simplified and improved handling of OpenMP options in CMake builds
reworked Windows DLL generation in CMake builds to ensure correct symbol
renaming (pre/postfixing) and optional generation of PDB files for debugging
updated the Perl script version of the gensymbol utility for use with
Windows-on-Arm
Fixed building with (Mingw) gmake on Windows to ensure completeness of the
LAPACK included in the static library (potential race condition due to the
Windows version of the "ln" utility creating snapshot copies rather than links)
fixed unwanted deletion of the lapacke_mangling.h file by "make clean"
fixed potential duplication of a _64 suffix on library names in CMake builds
fixed compilation of the C fallback copies of the LAPACK code with GCC 15
included fixed from the Reference-LAPACK project:
- fixed a truncated error message in the EIG part of the testsuite
  (Reference-LAPACK PR 1119)
- fixed too strict check in LAPACKE_?gesdd_work (PR #1126)
- fixed memory corruption when calling ?GEEV with non-finite data (PR #1128)
- fixed missing initialization of a variable in C/GEQP3RK (PR #1131)
- fixed 2nd dimension chosen in C/ZUNMLQ transposition operation (PR #1135)

x86_64:

fixed an error in the SBGEMV kernel for Cooper Lake/Sapphire Rapids
fixed corner cases of NAN and INF input handling in CSCAL and ZSCAL
improved the compiler identification code for flang-new
fixed a potential build issue in the ZSUM kernel
fixed "argument list too long" errors when building on MacOS
added cpu autodetection support for several new Arrow Lake models
fixed conditional inclusion of the fast path SGEMM kernel in DYNAMIC_ARCH
fixed compilation with the MinGW build of GCC 15

x86:

fixed corner cases of NAN and INF input handling in CSCAL and ZSCAL
worked around potential miscompilation of CDOT with very old binutils

arm64:

fixed cpu type detection of A64FX and some ThunderX models (broken in 0.3.29)
added support for the AmpereOne/1A cpus in DYNAMIC_ ARCH builds
added an optimized SBGEMM kernel for NEOVERSEV1
improved 1xN SBGEMM performance by forwarding to SBGEMV
introduced a stepwise increase of the thread count used for
SGEMM and SGEMV on NEOVERSEV1/V2 in relation to problem size
introduced a stepwise increase of the thread count used for
DGEMV on NEOVERSEV1 in relation to problem size
introduced a stepwise increase of the thread count used for
SDOT and DDOT on NEOVERSEV1 in relation to problem size
worked around assembler limitations in LLVM for Windows-on-Arm
enabled cpu type autodetection from the registry on Windows-on-Arm
improved multithreading threshold for GEMV and GESV on Windows-on-Arm
fixed overoptimization issues with LLVM's flang in Windows-on-Arm
fixed corner cases of NAN and INF input handling in CSCAL and ZSCAL
added a fast path SGEMM kernel for small workloads on SME capable targets
improved performance of SGEMM and DGEMM kernels for small workloads
improved performance of SGEMV and DGEMV on SVE-capable targets
improved performance of SGEMV on NEOVERSEN1 and Apple M
added optimized SSYMV and DSYMV kernels for NEOVERSEN1, Apple M and all
SVE capable targets
added optimized SBGEMV kernels for NEOVERSEV1/V2/N2
improved performance of SGEMM through faster NCOPY kernels
added compiler options for the NVIDIA HPC Compiler Suite
fixed compilation on OSX with XCode 16.3 and later
fixed cpu core type and cache size detection on Apple M4
updated GEMM parameter settings for Neoverse cpus in cross-builds with CMake
fixed default compiler options for NEOVERSEN1 and CORTEXX2 in CMake builds
fixed conditional inclusion of the fast path SGEMM kernel in DYNAMIC_ARCH
fixed potential miscompilation of the non-SVE SDOT kernel

arm:

fixed corner cases of NAN and INF input handling in CSCAL and ZSCAL
fixed unwanted generation of object files with a writable stack

riscv64:

added optimized SROTM and DROTM kernels for x280
fixed corner cases of NAN and INF input handling in CSCAL and ZSCAL
improved performance of GEMM_TCOPY on RVV1.0 targets with
VLEN of 128 or 256
improved performance of OMATCOPY on targets with VLEN 256
greatly improved performance of SGEMV/DGEMV
improved performance of CGEMV and ZGEMV on C910V and all RVV targets
with VLEN 256
improved performance of SAXPBY and DAXPBY on C910V and all RVV targets
with VLEN 256
improved performance of AXPY and DOT on C910V and ZVL256B targets by
falling back to non-vectorized code for very small N. (Thereby fixing
poor performance of CHBMV/ZHBMV for very small K)
fixed CMake build failures of the TRMM kernels

loongarch64:

improved performance of the LSX versions of SSYMV/DSYMV
made the LASX versions of the DSYMV and SSYMV kernels
compatible with hardware changes in LA664 and future targets
fixed inaccuracies in several LASX kernels
improved compatibility of LSX kernels with LA264 targets
fixed handling of deprecated target names in CMake builds
fixed corner cases of NAN and INF input handling in CSCAL and ZSCAL

power:

fixed building for PPCG4 with CMake
fixed SSCAL/DSCAL on PPC970 running FreeBSD
fixed a potential alignment issue in the POWER8 SGEMV kernel
fixed corner cases of NAN and INF input handling in CSCAL and ZSCAL

zarch:

fixed corner cases of NAN and INF input handling in CSCAL and ZSCAL
fixed unwanted generation of object files with a writable stack

sparc:

fixed corner cases of NAN and INF input handling in CSCAL and ZSCAL

alpha:

fixed build failure caused by spurious Windows-only typecasts

cell:

fixed probable build issue caused by spurious Windows-only typecasts

md5sums
8db3d57 OpenBLAS-0.3.30.tar.gz
a6075d9 OpenBLAS-0.3.30.zip
2cd4644 OpenBLAS-0.3.30-x86.zip
35035a8 OpenBLAS-0.3.30-x64.zip
d07457f OpenBLAS-0.3.30-x64-64.zip
22a6c5f OpenBLAS-0.3.30-woa64-static.zip
63bcb67 OpenBLAS-0.3.30-woa64-dll.zip
a96489f OpenBLAS-0.3.30-woa64-64-static.zip

`v0.3.29`: OpenBLAS 0.3.29 version

general:

fixed a potential NULL pointer dereference in multithreaded builds
added function aliases for GEMMT using its new name GEMMTR adopted by Reference-BLAS
fixed a build failure when building without LAPACK_DEPRECATED functions
the minimum required CMake version for CMake-based builds was raised to 3.16.0 in order
to remove many compatibility and deprecation warnings
added more detailed CMake rules for OpenMP builds (mainly to support recent LLVM)
fixed the behavior of the recently added CBLAS_?GEMMT functions with row-major data
improved thread scaling of multithreaded SBGEMV
improved thread scaling of multithreaded TRTRI
fixed compilation of the CBLAS testsuite with gcc14 (and no Fortran compiler)
added support for option handling changes in flang-new from LLVM18 onwards
added support for recent calling conventions changes in Cray and NVIDIA compilers
added support for compilation with the NAG Fortran compiler
fixed placement of the -fopenmp flag and libsuffix in the generated pkgconfig file
improved the CMakeConfig file generated by the Makefile build
fixed const-correctness of cblas_?geadd in cblas.h
fixed a potential inaccuracy in multithreaded BLAS3 calls
fixed empty implementations of get/set_affinity that print a warning in OpenMP builds
fixed function signatures for TRTRS in the converted C version of LAPACK
fixed omission of several single-precision LAPACK symbols in the shared library
improved build instructions for the provided "pybench" benchmarks
improved documentation, including added build instructions for WoA and HarmonyOS
as well as descriptions of environment variables that affect build and runtime behavior
added a separate "make install_tests" target for use with cross-compilations
integrated improvements and corrections from Reference-LAPACK:
- removed a comparison in LAPACKE ?tpmqrt that is always false (LAPACK PR 1062)
- fixed the leading dimension for B in tests for GGEV (LAPACK PR 1064)
- replaced the ?LARFT functions with a recursive implementation (LAPACK PR 1080)

arm:

fixed build with recent versions of the NDK (missing .type declaration of symbols)

arm64:

fixed a long-standing bug in the (generic) c/zgemm_beta kernel that could lead to
reads and writes outside the array bounds in some circumstances
rewrote cpu autodetection to scan all cores and return the highest performing type
improved the DGEMM performance for SVE targets and small matrix sizes
improved dimension criteria for forwarding from GEMM to GEMV kernels
added SVE kernels for ROT and SWAP
improved SVE kernels for SGEMV and DGEMV on A64FX and NEOVERSEV1
added support for using the "small matrix" kernels with CMake as well
fixed compilation on Windows on Arm
improved compile-time detection of SVE capability
added cpu autodetection and initial support for Apple M4
added support for compilation on systems running IOS
added support for compilation on NetBSD ("evbarm" architecture)
fixed NRM2 implementations for generic SVE targets and the Neoverse N2
fixed compilation for SVE-capable targets with the NVIDIA compiler

x86_64:

fixed a wrong storage size in the SBGEMV kernel for Cooper Lake
added cpu autodetection for Intel Granite Rapids
added cpu autodetection for AMD Ryzen 5 series
added optimized SOMATCOPY_CT for AVX-capable targets
fixed the fallback implementation of GEMM3M in GENERIC builds
tentatively re-enabled builds with the EXPRECISION option
worked around a miscompilation of tests with mingw32-gfortran14
added support for compilation with the Intel oneAPI 2025.0 compiler on Windows

power:

fixed multithreaded SBGEMM
fixed a CMake build problem on POWER10
improved the performance of SGEMV
added vectorized implementations of SBGEMV and support for forwarding 1xN SBGEMM to them
fixed illegal instructions and potential memory overflow in SGEMM on PPCG4
fixed handling of NaN and Inf arguments in SSCAL and DSCAL on PPC440,G4 and 970
added improved CGEMM and ZGEMM kernels for POWER10
added Makefile logic to remove all optimization flags in DEBUG builds

mips64:

fixed compilation with gcc14
fixed GEMM parameter selection for the MIPS64_GENERIC target
fixed a potential build failure when compiling with OpenMP

loongarch64:

fixed compilation for Loongson3 with recent versions of gmake
fixed a potential loss of precision in Loongson3A GEMM
fixed a potential build failure when compiling with OpenMP
added optimized SOMATCOPY for LASX-capable targets
introduced a new cpu naming scheme while retaining compatibility
added support for cross-compiling Loongarch64 targets with CMake
added support for compilation with LLVM

riscv64:

removed thread yielding overhead caused by sched_yield
replaced some non-standard intrinsics with their official names
fixed and sped up the implementations of CGEMM/ZGEMM TCOPY for vector lenghts 128 and 256
improved the performance of SNRM2/DNRM2 for RVV1.0 targets
added optimized ?OMATCOPY_CN kernels for RVV1.0 targets

md5sums
d7df286 OpenBLAS-0.3.29.zip
853a0c5 OpenBLAS-0.3.29.tar.gz
195aff9 OpenBLAS-0.3.29_x64.zip
bd44474 OpenBLAS-0.3.29_x64_64.zip
4831195 OpenBLAS-0.3.29_x86.zip

`v0.3.28`: OpenBLAS 0.3.28 version

general:

Reworked the unfinished implementation of HUGETLB from GotoBLAS
for allocating huge memory pages as buffers on suitable systems
Changed the unfinished implementation of GEMM3M for the generic
target on all architectures to at least forward to regular GEMM
Improved multithreaded GEMM performance for large non-skinny matrices
Improved BLAS3 performance on larger multicore systems through improved
parallelism
Improved performance of the initial memory allocation by reducing
locking overhead
Improved performance of GBMV at small problem sizes by introducing
a size barrier for the switch to multithreading
Added an implementation of the CBLAS_GEMM_BATCH extension
Fixed miscompilation of CAXPYC and ZAXPYC on all architectures in
CMAKE builds (error introduced in 0.3.27)
Fixed corner cases involving the handling of NAN and INFINITY
arguments in ?SCAL on all architectures
Added support for cross-compiling to WEBM with CMAKE (in addition
to the already present makefile support)
Fixed NAN handling and potential accuracy issues in compilations with
Intel ICX by supplying a suitable fp-model option by default
The contents of the github project wiki have been converted into
a new set of documentation included with the source code.
It is now possible to register a callback function that replaces
the built-in support for multithreading with an external backend
like TBB (openblas_set_threads_callback_function)
Fixed potential duplication of suffixes in shared library naming
Improved C compiler detection by the build system to tolerate more
naming variants for gcc builds
Fixed an unnecessary dependency of the utest on CBLAS
Fixed spurious error reports from the BLAS extensions utest
Fixed unwanted invocation of the GEMM3M tests in cross-compilation
Fixed a flaw in the makefile build that could lead to the pkgconfig
file containing an entry of UNKNOWN for the target cpu after installing
Integrated fixes from the Reference-LAPACK project:
- Fixed uninitialized variables in the LAPACK tests for ?QP3RK (PR 961)
- Fixed potential bounds error in ?UNHR_COL/?ORHR_COL (PR 1018)
- Fixed potential infinite loop in the LAPACK testsuite (PR 1024)
- Make the variable type used for hidden length arguments configurable (PR 1025)
- Fixed SYTRD workspace computation and various typos (PR 1030)
- Prevent compiler use of FMA that could increase numerical error in ?GEEVX (PR 1033)

x86_64:

reverted thread management under Windows to its state before 0.3.26
due to signs of race conditions in some circumstances now under study
fixed accidental selection of the unoptimized generic SBGEMM kernel
in CMAKE builds for CooperLake and SapphireRapids targets
fixed a potential thread buffer overrun in SBSTOBF16 on small systems
fixed an accuracy issue in ZSCAL introduced in 0.3.26
fixed compilation with CMAKE and recent releases of LLVM
added support for Intel Emerald Rapids and Meteor Lake cpus
added autodetection support for the Zhaoxin KX-7000 cpu
fixed autodetection of Intel Prescott (probably broken since 0.3.19)
fixed compilation for older targets with the Yocto SDK
fixed compilation of the converter-generated C versions
of the LAPACK sources with gcc-14
improved compiler options when building with CMAKE and LLVM for
AVX512-capable targets
added support for supplying the L2 cache size via an environment
variable (OPENBLAS_L2_SIZE) in case it is not correctly reported
(as in some VM configurations)
improved the error message shown when thread creation fails on startup
fixed setting the rpath entry of the dylib in CMAKE builds on MacOS

arm:

fixed building for baremetal targets with make

arm64:

Added a fast path forwarding SGEMM and DGEMM calls with a 1xN or Mx1
matrix to the corresponding GEMV kernel
added optimized SGEMV and DGEMV kernels for A64FX
added optimized SVE kernels for small-matrix GEMM
added A64FX to the cpu list for DYNAMIC_ARCH
fixed building with support for cpu affinity
worked around accuracy problems with C/ZNRM2 on NeoverseN1 and
Apple M targets
improved GEMM performance on Neoverse V1
fixed compilation for NEOVERSEN2 with older compilers
fixed potential miscompilation of the SVE SDOT and DDOT kernels
fixed potential miscompilation of the non-SVE CDOT and ZDOT kernels
fixed a potential overflow when using very large user-defined BUFFERSIZE
fixed setting the rpath entry of the dylib in CMAKE builds on MacOS

power:

Added a fast path forwarding SGEMM and DGEMM calls with a 1xN or Mx1
matrix to the corresponding GEMV kernel
significantly improved performance of SBGEMM on POWER10
fixed compilation with OpenMP and the XLF compiler
fixed building of the BLAS extension utests under AIX
fixed building of parts of the LAPACK testsuite with XLF
fixed CSWAP/ZSWAP on big-endian POWER10 targets
fixed a performance regression in SAXPY on POWER10 with OpenXL
fixed accuracy issues in CSCAL/ZSCAL when compiled with LLVM
fixed building for POWER9 under FreeBSD
fixed a potential overflow when using very large user-defined BUFFERSIZE
fixed an accuracy issue in the POWER6 kernels for GEMM and GEMV

riscv64:

Added a fast path forwarding SGEMM and DGEMM calls with a 1xN or Mx1
matrix to the corresponding GEMV kernel
fixed building for RISCV64_GENERIC with OpenMP enabled
added DYNAMIC_ARCH support (comprising GENERIC_RISCV64 and the two
RVV 1.0 targets with vector length of 128 and 256)
worked around the ZVL128B kernels for AXPBY mishandling the special
case of zero Y increment

loongarch64:

improved GEMM performance on servers of the 3C5000 generation
improved performance and stability of DGEMM
improved GEMV and TRSM kernels for LSX and LASX vector ABIs
fixed CMAKE compilation with the INTERFACE64 option set
fixed compilation with CMAKE
worked around spurious errors flagged by the BLAS3 tests
worked around a miscompilation of the POTRS utest by gcc 14.1
mips64:
fixed ASUM and SUM kernels to accept negative step sizes in X
fixed complex GEMV kernels for MSA

md5sums:
0f54185 OpenBLAS-0.3.28.tar.gz
2b3bb81 OpenBLAS-0.3.28.zip
8000151 OpenBLAS-0.3.28-x64-64.zip
a526ff1 OpenBLAS-0.3.28-x64.zip
660158a OpenBLAS-0.3.28-x86.zip

Configuration

📅 Schedule: (UTC)

Branch creation
- At any time (no schedule defined)
Automerge
- At any time (no schedule defined)

🚦 Automerge: Disabled by config. Please merge this manually once you are satisfied.

♻ Rebasing: Whenever PR becomes conflicted, or you tick the rebase/retry checkbox.

🔕 Ignore: Close this PR and you won't be reminded about this update again.

If you want to rebase/retry this PR, check this box

This PR was generated by Mend Renovate. View the repository job log.

renovate · 2026-04-10T16:10:26Z

⚠️ Artifact update problem

Renovate failed to update an artifact related to this branch. You probably do not want to merge this PR as-is.

♻ Renovate will retry this branch, including artifacts, only when one of the following happens:

any of the package files in this branch needs updating, or
the branch becomes conflicted, or
you click the rebase/retry checkbox if found above, or
you rename this PR's title to start with "rebase!" to trigger it manually

The artifact failure details are included below:

File name: pixi.lock

ExecError: Command failed: pixi lock --no-progress --color=never --quiet
Error:   × failed to solve requirements of environment 'default' for platform 'osx-
  │ arm64'
  ├─▶   × failed to solve the environment
  │   
  ╰─▶ Cannot solve the request because of: quasielasticbayes ==0.3.0 cannot be
      installed because there are no viable options:
      └─ quasielasticbayes 0.3.0 | 0.3.0 | 0.3.0 | 0.3.0 would require
         └─ python >=3.9,<3.10.0a0, for which no candidates were found.
      libopenblas ==0.3.33 cannot be installed because there are no viable
      options:
      └─ libopenblas 0.3.33 would require
         └─ llvm-openmp >=19.1.7, for which no candidates were found.
      The following packages are incompatible
      ├─ quasielasticbayes ==0.3.0 can be installed with any of the following
      options:
      │  └─ quasielasticbayes 0.3.0 | 0.3.0 would require
      │     └─ libgfortran 5.*, which can be installed with any of the
      following options:
      │        └─ libgfortran 5.0.0.dev0 | 5.0.0.dev0 | 5.0.0.dev0 |
      5.0.0.dev0 | 5.0.0.dev0 | 5.0.0.dev0 | 5.0.0.dev0 | 5.0.0.dev0 |
      5.0.0.dev0 | 5.0.0.dev0 | 5.0.0.dev0 | 5.0.0.dev0 | 5.0.0.dev0 |
      5.0.0.dev0 | 5.0.0.dev0 | 5.0.0.dev0 | 5.0.0.dev0 | 5.0.0 | 5.0.0 |
      5.0.0 | 5.0.0 | 5.0.0 | 5.0.0 | 5.0.0 | 5.0.0 | 5.0.0 | 5.0.0 | 5.0.0 |
      5.0.0 | 5.0.0 | 5.0.0 | 5.0.0 | 5.0.0 | 5.0.0 | 5.0.0 | 5.0.0 | 5.0.0 |
      5.0.0 | 5.0.0 | 5.0.0 | 5.0.0 | 5.0.0 | 5.0.0 | 5.0.0 | 5.0.0 | 5.0.0 |
      5.0.0 | 5.0.0 | 5.0.0 | 5.0.0 | 5.0.0 | 5.0.0 | 5.0.0 | 5.0.0 | 5.0.0 |
      5.0.0 | 5.0.0 | 5.0.0 | 5.0.0 | 5.0.0
      └─ libopenblas ==0.3.33 cannot be installed because there are no viable
      options:
         └─ libopenblas 0.3.33 would require
            └─ libgfortran5 >=14.3.0, which cannot be installed because there
      are no viable options:
               └─ libgfortran5 14.3.0 | 14.3.0 | 15.1.0 | 15.1.0 | 15.2.0 |
      15.2.0 | 15.2.0 | 15.2.0 | 15.2.0 | 15.2.0 | 15.2.0 | 15.2.0 | 15.2.0
      would constrain
                  └─ libgfortran ==14.3.0, which conflicts with any
      installable versions previously reported

MialLewis force-pushed the main branch from 9a3453f to 7b9b70e Compare April 22, 2026 09:35

renovate Bot force-pushed the renovate/libopenblas-0.x branch from 3e62a0f to 69f3ba4 Compare April 22, 2026 09:37

MialLewis force-pushed the main branch from 88ea88d to b5177cc Compare April 24, 2026 14:32

renovate Bot force-pushed the renovate/libopenblas-0.x branch from 69f3ba4 to fb539c9 Compare April 24, 2026 14:35

renovate Bot changed the title ~~Update dependency libopenblas to v0.3.32~~ Update dependency libopenblas to v0.3.33 Apr 24, 2026

MialLewis force-pushed the main branch 2 times, most recently from ea79c48 to 5061393 Compare April 29, 2026 12:44

renovate Bot force-pushed the renovate/libopenblas-0.x branch from fb539c9 to 75f6902 Compare April 29, 2026 12:47

MialLewis force-pushed the main branch from 4867bc7 to 14fc45d Compare May 11, 2026 09:44

Update dependency libopenblas to v0.3.33

55b1482

renovate Bot force-pushed the renovate/libopenblas-0.x branch from 75f6902 to 55b1482 Compare May 11, 2026 09:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update dependency libopenblas to v0.3.33#12

Update dependency libopenblas to v0.3.33#12
renovate[bot] wants to merge 1 commit into
mainfrom
renovate/libopenblas-0.x

renovate Bot commented Apr 10, 2026 •

edited

Loading

Uh oh!

renovate Bot commented Apr 10, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

0 participants

Conversation

renovate Bot commented Apr 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Release Notes

v0.3.33: OpenBLAS 0.3.33 version

general:

arm64:

loongarch64:

x86_64:

wasm:

v0.3.32: OpenBLAS 0.3.32 version

general:

arm64:

loongarch64:

power:

riscv64:

x86_64:

wasm:

v0.3.31: OpenBLAS 0.3.31 version

general:

riscv:

arm:

arm64:

power:

loongarch64:

x86:

x86_64:

zarch:

sparc:

v0.3.30: OpenBLAS 0.3.30 version

general:

x86_64:

x86:

arm64:

arm:

riscv64:

loongarch64:

power:

zarch:

sparc:

alpha:

cell:

v0.3.29: OpenBLAS 0.3.29 version

general:

arm:

arm64:

x86_64:

power:

mips64:

loongarch64:

riscv64:

v0.3.28: OpenBLAS 0.3.28 version

general:

x86_64:

arm:

arm64:

power:

riscv64:

loongarch64:

Configuration

Uh oh!

renovate Bot commented Apr 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

⚠️ Artifact update problem

File name: pixi.lock

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

0 participants

renovate Bot commented Apr 10, 2026 •

edited

Loading

`v0.3.33`: OpenBLAS 0.3.33 version

`v0.3.32`: OpenBLAS 0.3.32 version

`v0.3.31`: OpenBLAS 0.3.31 version

`v0.3.30`: OpenBLAS 0.3.30 version

`v0.3.29`: OpenBLAS 0.3.29 version

`v0.3.28`: OpenBLAS 0.3.28 version

renovate Bot commented Apr 10, 2026 •

edited

Loading