accelerating spatial samplers #85

SLongshaw · 2022-03-04T09:15:43Z

The spatial samplers are all currently done as single threaded function call on the CPU, there might be scope for accelerating the majority for many-core architectures given they basically all loop over a set of points.

Immediate issues:

Memory transfer - the cell list is built for each data frame, so either that or the individual "data_points" subset found from the cell list would need to be transferred - this will be costly and hard to hide, might be scope for using CUDA MPI type approach.
The calls to filter() usually work on small subsets (< 50 points) - these would need to be bundled up and run in parallel to make the most of a GPU.
Ideally a solution should be hardware agnostic, so should be focused either on low-level like OpenCL or higher-level like SYCL type approach.

SLongshaw · 2023-08-02T10:42:22Z

Work is currently underway through a funded Intel oneAPI Centre of Excellence to accelerate parts of the library that rely on linear algebra (e.g. Radial Basis) using SYCL

https://www.scd.stfc.ac.uk/Pages/STFC-oneAPI-Centre.aspx

SLongshaw added enhancement help wanted question labels Mar 4, 2022

SLongshaw self-assigned this Mar 4, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

accelerating spatial samplers #85

accelerating spatial samplers #85

SLongshaw commented Mar 4, 2022

SLongshaw commented Aug 2, 2023

accelerating spatial samplers #85

accelerating spatial samplers #85

Comments

SLongshaw commented Mar 4, 2022

SLongshaw commented Aug 2, 2023