Pinned vector factory that uses the global pool #15895

vuule · 2024-05-31T01:08:31Z

Description

closes #15612
Expanded the set of vector factories to cover pinned vectors. The functions return cudf::detail::host_vector, which use a type-erased allocator, allowing us to utilize the runtime configurable global pinned (previously host) resource.
The pinned_host_vector type has been removed as it can only support the non-pooled pinned allocations. Its use is not replaced with cudf::detail::host_vector.
Moved the global host (now pinned) resource out of cuIO and changed the type to host_device. User-specified resources are now required to allocate device-accessible memory. The name has been changed to pinned to reflect the new requirement.

Checklist

I am familiar with the Contributing Guidelines.
New or existing tests cover these changes.
The documentation is up to date with these changes.

vuule · 2024-05-31T02:14:30Z

cpp/src/io/csv/reader_impl.cu

@@ -27,6 +27,7 @@
 #include "io/utilities/parsing_utils.cuh"

 #include <cudf/detail/utilities/cuda.cuh>
+#include <cudf/detail/utilities/logger.hpp>


used to be included indirectly, same as cpp/src/io/orc/reader_impl_chunking.cu

vuule · 2024-05-31T02:14:47Z

cpp/include/cudf/detail/utilities/vector_factories.hpp

@@ -380,7 +382,7 @@ thrust::host_vector<T> make_host_vector_async(device_span<T const> v, rmm::cuda_
 * @brief Asynchronously construct a `std::vector` containing a copy of data from a device
 * container
 *
- * @note This function synchronizes `stream`.
+ * @note This function does not synchronize `stream`.


Out of the scope of the current PR, the _sync suffix in make_host_vector_sync is wordy and confusing and not aligned with CUDA/C++ naming convention: APIs are treated as synchronous by default and only asynchronous ones are named by adding a _async suffix. I assume this is also why the typo was introduced in the first place.

vuule · 2024-05-31T02:15:09Z

cpp/include/cudf/detail/utilities/rmm_host_vector.hpp

@@ -19,6 +19,7 @@
 #include <cudf/utilities/default_stream.hpp>
 #include <cudf/utilities/error.hpp>

+#include <rmm/aligned.hpp>


used to be included indirectly

vuule · 2024-05-31T02:19:15Z

cpp/src/io/text/data_chunk_source_factories.cpp

@@ -32,7 +33,7 @@ namespace {

 struct host_ticket {
  cudaEvent_t event;
-  cudf::detail::pinned_host_vector<char> buffer;
+  std::unique_ptr<cudf::detail::rmm_host_vector<char>> buffer;


there's an array of host_tickets, so it needs to have a default constructor

vuule · 2024-05-31T02:20:05Z

cpp/src/utilities/pinned_memory.cpp

moved to a separate file with plans to add other user-facing APIs related to pinned memory.
No real changes to the code, just move + rebrand to pinned

…fea-pinned-vector-factory

vuule · 2024-05-31T22:56:04Z

@abellina I changed the API namespace and replaced host with pinned to reflect the new requirement; updated the Java side until CI passed, please review when you get a chance.

java/src/main/java/ai/rapids/cudf/PinnedMemoryPool.java

…fea-pinned-vector-factory

harrism

Nice work, and thanks for the design discussions!

davidwendt · 2024-06-11T17:36:38Z

cpp/src/utilities/pinned_memory.cpp

+static_assert(cuda::mr::resource_with<fixed_pinned_pool_memory_resource,
+                                      cuda::mr::device_accessible,
+                                      cuda::mr::host_accessible>,
+              "");


Should this have a message? This would be a compile-time error, right?

vuule · 2024-06-12T00:47:14Z

/merge

vuule added 3 commits May 30, 2024 16:24

remove pinned_host_vector

eb39019

switch to host_device resource ref

24b1245

rebrand host memory resource

6c896f6

vuule added feature request New feature or request breaking Breaking change labels May 31, 2024

vuule self-assigned this May 31, 2024

github-actions bot added libcudf Affects libcudf (C++/CUDA) code. CMake CMake build issue labels May 31, 2024

style

0048c59

vuule commented May 31, 2024

View reviewed changes

vuule added 2 commits May 31, 2024 10:39

java update because breaking

1964523

Merge branch 'branch-24.08' of https://github.com/rapidsai/cudf into …

f871ca0

…fea-pinned-vector-factory

github-actions bot added the Java Affects Java cuDF API. label May 31, 2024

vuule added 3 commits May 31, 2024 12:04

java fix

ac0ce9c

Merge branch 'branch-24.08' of https://github.com/rapidsai/cudf into …

b610ba3

…fea-pinned-vector-factory

move test out of io util

ab36162

vuule marked this pull request as ready for review June 3, 2024 17:13

vuule requested review from a team as code owners June 3, 2024 17:13

vuule requested review from PointKernel and davidwendt June 3, 2024 17:13

abellina self-requested a review June 3, 2024 21:33

abellina reviewed Jun 3, 2024

View reviewed changes

java/src/main/java/ai/rapids/cudf/PinnedMemoryPool.java Show resolved Hide resolved

Merge branch 'branch-24.08' of https://github.com/rapidsai/cudf into …

69a1bce

…fea-pinned-vector-factory

github-actions bot added Python Affects Python cuDF API. cudf.pandas Issues specific to cudf.pandas pylibcudf Issues specific to the pylibcudf package labels Jun 11, 2024

vuule changed the base branch from branch-24.08 to branch-24.06 June 11, 2024 00:28

vuule requested a review from a team as a code owner June 11, 2024 00:28

vuule changed the base branch from branch-24.06 to branch-24.08 June 11, 2024 00:29

make do without host_uvector

f312219

github-actions bot removed Python Affects Python cuDF API. cudf.pandas Issues specific to cudf.pandas pylibcudf Issues specific to the pylibcudf package labels Jun 11, 2024

missed change

7cfee0a

vuule requested a review from harrism June 11, 2024 00:43

vyasr removed request for a team June 11, 2024 00:50

style

fe4d668

vuule removed request for msarahan, mroeschke and Matt711 June 11, 2024 00:55

vuule added the 3 - Ready for Review Ready for review by team label Jun 11, 2024

harrism approved these changes Jun 11, 2024

View reviewed changes

davidwendt reviewed Jun 11, 2024

View reviewed changes

davidwendt approved these changes Jun 11, 2024

View reviewed changes

static assert message

2d63f5a

vuule added 5 - Ready to Merge Testing and reviews complete, ready to merge and removed 3 - Ready for Review Ready for review by team labels Jun 11, 2024

rapids-bot bot merged commit f7ba6ab into rapidsai:branch-24.08 Jun 12, 2024
76 checks passed

Kh4ster mentioned this pull request Sep 30, 2024

[FEA]: Have a less verbose pinned memory container NVIDIA/cccl#2485

Closed

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pinned vector factory that uses the global pool #15895

Pinned vector factory that uses the global pool #15895

vuule commented May 31, 2024 •

edited

Loading

vuule May 31, 2024

vuule May 31, 2024

PointKernel Jun 4, 2024

vuule May 31, 2024

vuule May 31, 2024

vuule May 31, 2024 •

edited

Loading

vuule commented May 31, 2024

harrism left a comment

davidwendt Jun 11, 2024

vuule commented Jun 12, 2024

Pinned vector factory that uses the global pool #15895

Pinned vector factory that uses the global pool #15895

Conversation

vuule commented May 31, 2024 • edited Loading

Description

Checklist

vuule May 31, 2024

Choose a reason for hiding this comment

vuule May 31, 2024

Choose a reason for hiding this comment

PointKernel Jun 4, 2024

Choose a reason for hiding this comment

vuule May 31, 2024

Choose a reason for hiding this comment

vuule May 31, 2024

Choose a reason for hiding this comment

vuule May 31, 2024 • edited Loading

Choose a reason for hiding this comment

vuule commented May 31, 2024

harrism left a comment

Choose a reason for hiding this comment

davidwendt Jun 11, 2024

Choose a reason for hiding this comment

vuule commented Jun 12, 2024

vuule commented May 31, 2024 •

edited

Loading

vuule May 31, 2024 •

edited

Loading