Support for Rayon integration #33

rohitjoshi · 2018-04-19T20:45:29Z

Rayon supports parallel iterators/mapv function to process using multiple threads. How can we integrate with rayon so we can leverage both simd and thread parallel processing?

AdamNiederer · 2018-04-19T20:48:54Z

You can do something like arr.par_iter(|chunk| chunk.simd_iter(|vec| ...)) to use both multithreading and SIMD

rohitjoshi · 2018-05-07T13:27:27Z

I tried your suggestion and getting an error.
e.g.

pub fn sqrt_par_simd(a: &[f64]) -> Vec<f64> {
   a.par_iter(|chunk| {
        chunk
            .simd_iter()
            .simd_map(f64s(0.0), |index| index.sqrt())
            .scalar_collect()
    }).collect()
}

Error:

error[E0061]: this function takes 0 parameters but 1 parameter was supplied
  --> src/prior.rs:34:7
   |
34 |     a.par_iter(|chunk| {
   |       ^^^^^^^^ expected 0 parameters

error[E0277]: the trait bound `std::vec::Vec<f64>: rayon::iter::FromParallelIterator<&f64>` is not satisfied
  --> src/prior.rs:39:8
   |
39 |     }).collect()
   |        ^^^^^^^ the trait `rayon::iter::FromParallelIterator<&f64>` is not implemented for `std::vec::Vec<f64>`
   |
   = help: the following implementations were found:
             <std::vec::Vec<T> as rayon::iter::FromParallelIterator<T>>

andersk · 2018-12-31T13:32:45Z

The Rayon syntax you’re looking for is a.par_chunks(128).flat_map(|chunk| …).collect(). (Pick your favorite chunk size.)

Titaniumtown · 2021-03-24T16:39:17Z

The Rayon syntax you’re looking for is a.par_chunks(128).flat_map(|chunk| …).collect(). (Pick your favorite chunk size.)

How would that translate if I wanted to filter elements instead of map?

andersk · 2021-03-24T17:13:06Z

AFAIK Faster doesn’t currently provide a way to accelerate filter, with or without Rayon—so you’d just use the normal Rayon filter.

(If in the future some kind of filter is added to Faster, the same construction would work: you’d use Rayon’s .par_chunks().flat_map() around the hypothetical filter.)

AdamNiederer · 2021-03-25T03:07:43Z

AFAIK Faster doesn’t currently provide a way to accelerate filter, with or without Rayon—so you’d just use the normal Rayon filter.

That's correct, and it's unlikely that it will without AVX-512; SSE and AVX don't really have the underlying instructions required to yield an appreciable performance improvement, save for specific cases which wouldn't work well in a general library.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for Rayon integration #33

Support for Rayon integration #33

rohitjoshi commented Apr 19, 2018

AdamNiederer commented Apr 19, 2018

rohitjoshi commented May 7, 2018 •

edited

Loading

andersk commented Dec 31, 2018

Titaniumtown commented Mar 24, 2021

andersk commented Mar 24, 2021 •

edited

Loading

AdamNiederer commented Mar 25, 2021

Support for Rayon integration #33

Support for Rayon integration #33

Comments

rohitjoshi commented Apr 19, 2018

AdamNiederer commented Apr 19, 2018

rohitjoshi commented May 7, 2018 • edited Loading

andersk commented Dec 31, 2018

Titaniumtown commented Mar 24, 2021

andersk commented Mar 24, 2021 • edited Loading

AdamNiederer commented Mar 25, 2021

rohitjoshi commented May 7, 2018 •

edited

Loading

andersk commented Mar 24, 2021 •

edited

Loading