[REVIEW] Implement C/CUDA RandomWalks functionality #1439

aschaffer · 2021-03-04T16:39:36Z

This PR tracks work on issue: #1380.

Merge latest release 0.17

Merge latest branch-0.18

Update forked branch-0.18

Update forked branch-0.18 from release

Update branch-0.19 from release

update forked from release branch-0.19

…t_random_walks

codecov-io · 2021-03-10T00:08:42Z

Codecov Report

Merging #1439 (d1d4b2f) into branch-0.19 (369beee) will decrease coverage by 2.45%.
The diff coverage is n/a.

@@               Coverage Diff               @@
##           branch-0.19    NVIDIA/thrust#1439      +/-   ##
===============================================
- Coverage        60.72%   58.26%   -2.46%     
===============================================
  Files               70       71       +1     
  Lines             3132     3283     +151     
===============================================
+ Hits              1902     1913      +11     
- Misses            1230     1370     +140

Impacted Files	Coverage Δ
python/cugraph/utilities/utils.py	`70.76% <0.00%> (-0.89%)`	⬇️
python/cugraph/dask/common/part_utils.py	`20.66% <0.00%> (-0.18%)`	⬇️
python/cugraph/__init__.py	`100.00% <0.00%> (ø)`
python/cugraph/community/egonet.py	`91.42% <0.00%> (ø)`
python/cugraph/traversal/__init__.py	`100.00% <0.00%> (ø)`
python/cugraph/dask/community/louvain.py	`33.33% <0.00%> (ø)`
python/cugraph/tree/minimum_spanning_tree.py	`85.36% <0.00%> (ø)`
python/cugraph/dask/structure/renumber.py
python/cugraph/structure/new_number_map.py	`0.00% <0.00%> (ø)`
python/cugraph/traversal/ms_bfs.py	`11.11% <0.00%> (ø)`
... and 2 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update c1047ed...d1d4b2f. Read the comment docs.

cpp/tests/experimental/rw_low_level_test.cu

rlratzel · 2021-03-26T04:08:29Z

cpp/tests/experimental/rw_low_level_test.cu

+  EXPECT_EQ(v_ro, v_ro_expected);
+  EXPECT_EQ(v_ci, v_ci_expected);
+  EXPECT_EQ(v_vs, v_vs_expected);


Is the intention here to test make_graph? If so, maybe that would be better off as a separate test to 1) clean up this test, 2) make the test name more accurate, and 3) better communicate that a failure here isn't a failure in RW.

No. The intention is to factor out make_graph() to be called in several tests to make small graphs with predictable (and observable) topology.

I think it would be better to test make_graph() separately, then move all the repetitive inline calls to make_graph() to a SetUp().

I'm not testing make_graph(). Is not doing enough work to warrant separate testing. It's just providing a reproducible graph to further test RW on.

make_graph() is just calling the graph_t constructor and returns a graph_t instance. This path was extensively tested when graph_t was added to our code base. Re-testing old / existing functionality doesn't bring any value, it just bloats our code base and makes tests harder to understand and isolate functionality.

cpp/tests/experimental/rw_low_level_test.cu

python/cugraph/sampling/random_walks.pxd

…t 2 different testsuites.

…ure.

… failures are blocking CI.

…-up.

…ng policy to allow for seed repro testing/debugging.

…lity (start/step/stop).

cpp/include/algorithms.hpp

ChuckHastings · 2021-03-29T15:07:34Z

cpp/include/algorithms.hpp

+           rmm::device_uvector<typename graph_t::weight_type>,
+           rmm::device_uvector<index_t>>
+random_walks(raft::handle_t const &handle,
+             graph_t const &graph,


That seems reasonable. The louvain example (which uses graph_t - changed to graph_view_t in the latest PR) that Andrei may have copied this from does this because the same API supports both experimental and legacy graph classes. Things that only support the experimental graph classes should follow the explicit paradigm, I think.

cpp/include/algorithms.hpp

ChuckHastings · 2021-03-29T15:18:35Z

cpp/include/algorithms.hpp

+             graph_t const &graph,
+             typename graph_t::vertex_type const *ptr_d_start,
+             index_t num_paths,
+             index_t max_depth);


num_paths is only used to convert the raw pointer/size back into a vector in your implementation. That should probably be a size_t.

Do we expect the max_depth to be a larger integer? Would hardcoding that to an int32_t be sufficient? The returned results is going to be O(num_paths * max_depth). That will limit the value that you can pass in to max_depth anyway.

…was causing 10.1 compile failures.

aschaffer · 2021-03-29T17:45:19Z

Perhaps max_depth could just be int32_t, but making it a different type than num_paths would complicate the interface and potentially make it a bit confusing. Perhaps the more consistent / cleaner interface we have would pay off more in the future.

seunghwak

Thanks!!!

aschaffer · 2021-03-30T17:57:13Z

Thanks!!!

Thank you!

BradReesWork · 2021-03-30T18:09:29Z

@gpucibot merge

aschaffer and others added 7 commits November 30, 2020 15:00

Merge pull request #37 from rapidsai/branch-0.17

acf15bc

Merge latest release 0.17

Merge pull request #38 from rapidsai/branch-0.18

a584e0b

Merge latest branch-0.18

Merge pull request #39 from rapidsai/branch-0.18

338b2d4

Update forked branch-0.18

Merge pull request #40 from rapidsai/branch-0.18

efd7e9a

Update forked branch-0.18 from release

Merge pull request #41 from rapidsai/branch-0.19

a23ce0d

Update branch-0.19 from release

Merge pull request #42 from rapidsai/branch-0.19

22b32d8

update forked from release branch-0.19

Merge branch 'branch-0.19' of github.com:rapidsai/cugraph into fea_ex…

7a5c86f

…t_random_walks

rlratzel added improvement Improvement / enhancement to an existing function non-breaking Non-breaking change feature request New feature or request and removed improvement Improvement / enhancement to an existing function labels Mar 5, 2021

BradReesWork assigned aschaffer Mar 9, 2021

BradReesWork added this to the 0.19 milestone Mar 9, 2021

Added API signature.

1142afa

Algorithm implementation. Main players.

bf88414

aschaffer requested a review from a team as a code owner March 10, 2021 19:03

aschaffer added 2 commits March 10, 2021 14:53

Algorithm implementation: logic flow.

9b9d024

Added preliminary test.

b41d7c4

aschaffer mentioned this pull request Mar 11, 2021

[FEA] RandomWalk analytic - C/CUDA Code #1380

Closed

Fixed compiler errors.

d50ffc6

aschaffer requested a review from a team as a code owner March 11, 2021 17:09

aschaffer added 7 commits March 11, 2021 20:32

More implementation details on overall RW stepping.

ff1aa77

Added defragmentation functionality.

c79d868

Homogeneous andom generators wrappers.

8ee4c3f

Homogeneous random generators wrappers. Raft generator.

e91f26b

Raft generator fix.

3b119d6

Added coalesced extractor utility.

a906f3f

Warning fixes. Added rnd conversion from real to int.

1a08bb9

aschaffer added 5 commits March 25, 2021 17:42

Addressed comments reviews on using raft::update_host().

e0dc56f

Addressed comment review on used type alias.

5261f41

Addressed comments reviews on using raft::update_device().

41f58e2

Addressed comments reviews on using raft::print_host_vector().

a51dd83

Addressed comment reviews on replacing all_of() with count_if().

6aa2d12

rlratzel reviewed Mar 26, 2021

View reviewed changes

aschaffer added 12 commits March 26, 2021 11:01

Factored out some RW verification functionality to be used in at leas…

bc43af7

…t 2 different testsuites.

Addressed comment reviews on testing the RW results in main test fixt…

8972b97

…ure.

Addressed comment review on changing a comment.

73413c4

Addressed comment review on (temporarily) removing cython code as its…

08db6ac

… failures are blocking CI.

Addressed comment review on missing some tests.

667e814

Addressed comment review on adding FIXME comment and some minor clean…

6c2b12b

…-up.

Addressed comment reviews on fixing dox.

9c1bbc7

Addressed comment reviews on better grouping of header includes.

edee37f

Addressed comment review on seed_t defaulting to uint64_t.

b8c5911

Addressed comment review on nPaths cleanup. Also abstracted out seedi…

d1d4b2f

…ng policy to allow for seed repro testing/debugging.

Addressed comment reviews on seed propagation for test / debug repro.

da7d80c

Addressed comment reviews on re-ordering member functions for readabi…

3511bae

…lity (start/step/stop).

ChuckHastings reviewed Mar 29, 2021

View reviewed changes

aschaffer added 2 commits March 29, 2021 11:15

Addressed comment reviews on fixing dox.

468dc40

Removed unnecessary constraint on random_engine_t construction which …

9a386bf

…was causing 10.1 compile failures.

BradReesWork changed the title ~~[REVIEW] Implement RandomWalks functionality~~ [REVIEW] Implement C/CUDA RandomWalks functionality Mar 30, 2021

BradReesWork approved these changes Mar 30, 2021

View reviewed changes

ChuckHastings approved these changes Mar 30, 2021

View reviewed changes

seunghwak approved these changes Mar 30, 2021

View reviewed changes

rapids-bot bot merged commit f2e5a87 into rapidsai:branch-0.19 Mar 30, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[REVIEW] Implement C/CUDA RandomWalks functionality #1439

[REVIEW] Implement C/CUDA RandomWalks functionality #1439

aschaffer commented Mar 4, 2021

codecov-io commented Mar 10, 2021 •

edited

Loading

rlratzel Mar 26, 2021

aschaffer Mar 26, 2021

rlratzel Mar 26, 2021

aschaffer Mar 26, 2021 •

edited

Loading

ChuckHastings Mar 29, 2021

ChuckHastings Mar 29, 2021

aschaffer commented Mar 29, 2021 •

edited

Loading

seunghwak left a comment

aschaffer commented Mar 30, 2021

BradReesWork commented Mar 30, 2021

[REVIEW] Implement C/CUDA RandomWalks functionality #1439

[REVIEW] Implement C/CUDA RandomWalks functionality #1439

Conversation

aschaffer commented Mar 4, 2021

codecov-io commented Mar 10, 2021 • edited Loading

Codecov Report

rlratzel Mar 26, 2021

Choose a reason for hiding this comment

aschaffer Mar 26, 2021

Choose a reason for hiding this comment

rlratzel Mar 26, 2021

Choose a reason for hiding this comment

aschaffer Mar 26, 2021 • edited Loading

Choose a reason for hiding this comment

ChuckHastings Mar 29, 2021

Choose a reason for hiding this comment

ChuckHastings Mar 29, 2021

Choose a reason for hiding this comment

aschaffer commented Mar 29, 2021 • edited Loading

seunghwak left a comment

Choose a reason for hiding this comment

aschaffer commented Mar 30, 2021

BradReesWork commented Mar 30, 2021

codecov-io commented Mar 10, 2021 •

edited

Loading

aschaffer Mar 26, 2021 •

edited

Loading

aschaffer commented Mar 29, 2021 •

edited

Loading