Inspect and control the number of threads when BLAS is implemented by Apple Accelerate #136

ogrisel · 2023-03-25T15:18:40Z

Follow-up on #135.

I am not sure if it's possible or not.

ogrisel · 2023-04-04T12:19:48Z

I have not conducted an extensive evaluation yet, but it seems that we do not suffer from oversubscription problems when calling vecLib's GEMM under OpenMP threads (for instance, in scikit-learn's KMeans). So maybe there is some kind of automated mechanism in Grand Central Dispatch that prevents the usual oversubscription problem we observed with other threaded BLAS libraries.

ogrisel · 2024-01-15T13:53:58Z

At least we could detect that Accelerate is linked, even if we cannot inspect or set the number of threads.

ogrisel · 2024-01-15T14:00:11Z

Apparently it's possible to tell vecLib to not use all threads via an environment variable: VECLIB_MAXIMUM_THREADS.

EDIT: it does not seem to have much effect on numpy workloads (matmul & SVD) linked against Accelerate on a Mac M1 host.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Inspect and control the number of threads when BLAS is implemented by Apple Accelerate #136

Inspect and control the number of threads when BLAS is implemented by Apple Accelerate #136

ogrisel commented Mar 25, 2023

ogrisel commented Apr 4, 2023

ogrisel commented Jan 15, 2024

ogrisel commented Jan 15, 2024 •

edited

Loading

Inspect and control the number of threads when BLAS is implemented by Apple Accelerate #136

Inspect and control the number of threads when BLAS is implemented by Apple Accelerate #136

Comments

ogrisel commented Mar 25, 2023

ogrisel commented Apr 4, 2023

ogrisel commented Jan 15, 2024

ogrisel commented Jan 15, 2024 • edited Loading

ogrisel commented Jan 15, 2024 •

edited

Loading