You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Based on discussions offline with @magnatelee, it appears that cunumeric's reduction operations are relatively slow. I replaced the reduction operation of "count_non_zeros" in the Nonzero task with a thrust::transform_reduce, and found that I achieved much better performance. I also tested out removing the accessor and directly indexing the input buffers and saw little change in performance.
The text was updated successfully, but these errors were encountered:
Based on discussions offline with @magnatelee, it appears that cunumeric's reduction operations are relatively slow. I replaced the reduction operation of "count_non_zeros" in the
Nonzero
task with athrust::transform_reduce
, and found that I achieved much better performance. I also tested out removing the accessor and directly indexing the input buffers and saw little change in performance.The text was updated successfully, but these errors were encountered: