-
I am implementing a CSR2COO conversion function similar in functionality to cusparse::csr2coo using DeviceMemcpy::Batched by passing a thrust iterator as the input buffer but I am getting the following error:
Is such a compile check really necessary? The following code snippet includes my use case, the error occurs because input_buffer is a transform iterator and the conversion from
|
Beta Was this translation helpful? Give feedback.
Replies: 5 comments 1 reply
-
I am also wondering if it is crucial for performance to use uint32_t for number of buffers, as for graphs with more than 2^32 vertices, this code wouldn't work. |
Beta Was this translation helpful? Give feedback.
-
Given buffer sizes |
Beta Was this translation helpful? Give feedback.
-
Upon further investigation, the line making the contiguous buffer assumption is: If we could generalize the type of the source pointers there from |
Beta Was this translation helpful? Give feedback.
-
Thank you for your question and for providing more information about your use case, @mfbalin!
That said, we understand that there is demand to extend support of I think the best path forward is to specialize a few related methods, such as I have created an issue here: NVIDIA/cub#674 |
Beta Was this translation helpful? Give feedback.
-
DeviceCopy::Batched was contributed some time ago. |
Beta Was this translation helpful? Give feedback.
DeviceCopy::Batched was contributed some time ago.