[REVIEW] Improve memory scaling for low average vertex degree graphs & many GPUs #1823

seunghwak · 2021-09-16T15:29:52Z

Assuming sqrt(P) is an integer, matrix row/column property array sizes in each GPU scale as V/sqrt(P) while the storage requirement for graph edges scale as E/P. If E/V is small and P is large, the O(V/sqrt(P)) part will dominate the memory requirement and analyzing N times larger graphs will require N^2 times more GPUs; this is unacceptable. However, in this case, at most E/P elements of the V/sqrt(P) sized array will be accessed, so no need to store the whole V/sqrt(P) values. Instead, we can store row/column properties in (key, value) pairs limiting the memory requirement to be the minimum of V/sqrt(P) and E/P.

This PR supports storing matrix row/column properties in (key, value) pairs if the percentage of actually accessed elements is lower than the threshold value (the code has been tested only up to 8 GPUs, and there was no clear benefit at this scale; currently the threshold value is set to 0 and (key, value) pair support is never enabled, but the threshold value will be adjusted later after large scale testing).

…v_pair_part1

…ate thrust::tuple_cat

…policy()

…v_pair_part1

seunghwak · 2021-09-19T18:06:06Z

rerun tests

codecov-commenter · 2021-09-20T16:41:54Z

Codecov Report

Merging #1823 (318ceff) into branch-21.10 (bf64c2c) will increase coverage by 9.72%.
The diff coverage is n/a.

❗ Current head 318ceff differs from pull request most recent head 78cfbda. Consider uploading reports for the commit 78cfbda to get more accurate results

@@               Coverage Diff                @@
##           branch-21.10    #1823      +/-   ##
================================================
+ Coverage         59.85%   69.57%   +9.72%     
================================================
  Files                77      139      +62     
  Lines              3547     8645    +5098     
================================================
+ Hits               2123     6015    +3892     
- Misses             1424     2630    +1206

Impacted Files	Coverage Δ
python/cugraph/utilities/grmat.py
python/cugraph/link_analysis/hits.py
python/cugraph/comms/comms.py
python/cugraph/proto/__init__.py
python/cugraph/traversal/traveling_salesperson.py
python/cugraph/community/egonet.py
python/cugraph/utilities/utils.py
python/cugraph/dask/community/louvain.py
python/cugraph/cores/core_number.py
...ph/structure/graph_implementation/npartiteGraph.py
... and 206 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 032f86a...78cfbda. Read the comment docs.

seunghwak · 2021-09-20T17:25:57Z

rerun tests

ChuckHastings

The description field for this PR should be updated with an actual description of the work.

BradReesWork · 2021-09-22T16:30:21Z

@gpucibot merge

seunghwak added 30 commits August 19, 2021 11:35

delete unused file

723688f

Merge branch 'branch-21.10' of github.com:rapidsai/cugraph into fea_k…

a1ecbd2

…v_pair_part1

Merge branch 'branch-21.10' of github.com:rapidsai/cugraph into fea_k…

5b02cdb

…v_pair_part1

update headers to support row/col input properties wrapper

aaf7bb3

update to use the wrapper

06ed6c5

resolve merge conflicts

8be0108

Merge branch 'branch-21.10' of github.com:rapidsai/cugraph into fea_k…

efb5e3a

…v_pair_part1

Merge branch 'branch-21.10' of github.com:rapidsai/cugraph into fea_k…

449150d

…v_pair_part1

fix MG Louvain test compile errors

32495c5

clang-format

9e4514c

add thrust utility function to convert to/from std::tuple and to emul…

be996b3

…ate thrust::tuple_cat

added a wrapper class for row/col properties

2f65f41

update prims to use the row/col properties wrapper

94717d9

update algorithms to use row/col properties wrapper

a6fec7e

resolve merge conflicts

770b424

clang-format

670d891

replace rmm::exec_policy(hanlde.get_stream()) with handle.get_thrust_…

e2e4b13

…policy()

code refinements

a35d137

code clean-up

83d7313

clang-format

92972ed

documentation update

f39d266

bug fixes

06c3fa9

additional bug fix

6fde1c0

Merge branch 'upstream_pr1797' into fea_kv_pair_part1

2f88953

Merge branch 'branch-21.10' of github.com:rapidsai/cugraph into fea_k…

ab30776

…v_pair_part1

MG WCC bug fix

9d25030

Merge branch 'upstream_pr1802' into fea_kv_pair_part1

8d4e0db

device lambda to struct functor

dafa4ed

Merge branch 'branch-21.10' of github.com:rapidsai/cugraph into fea_k…

e330c91

…v_pair_part1

cleanup multi-source BFS artifacts

0734f2a

seunghwak added Graph Prims improvement Improvement / enhancement to an existing function non-breaking Non-breaking change labels Sep 16, 2021

seunghwak added this to the 21.10 milestone Sep 16, 2021

undo debug printouts

e9b850a

seunghwak added 5 commits September 17, 2021 13:45

bug fix

ec02b9d

bug fix

82b6f75

fix compiler warning

2007fe1

bug fix

e264dc2

resolve merge conflicts

7871549

seunghwak changed the title ~~[WIP][skip-ci] Improve memory scaling for low average vertex degree graphs & many GPUs~~ [WIP] Improve memory scaling for low average vertex degree graphs & many GPUs Sep 19, 2021

seunghwak added 3 commits September 19, 2021 14:41

clnag-format

28e0741

adjust variable scope to free memory buffer when unnecessary

f7af95b

disable (key, value) pairs

78cfbda

seunghwak changed the title ~~[WIP] Improve memory scaling for low average vertex degree graphs & many GPUs~~ [REVIEW] Improve memory scaling for low average vertex degree graphs & many GPUs Sep 20, 2021

seunghwak added 3 - Ready for Review and removed 2 - In Progress DO NOT MERGE Hold off on merging; see PR for details labels Sep 20, 2021

BradReesWork requested review from kaatish and ChuckHastings September 21, 2021 15:07

ChuckHastings approved these changes Sep 21, 2021

View reviewed changes

rapids-bot bot merged commit 7cabcd0 into rapidsai:branch-21.10 Sep 22, 2021

seunghwak deleted the enh_mem_scaling branch October 19, 2021 21:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[REVIEW] Improve memory scaling for low average vertex degree graphs & many GPUs #1823

[REVIEW] Improve memory scaling for low average vertex degree graphs & many GPUs #1823

seunghwak commented Sep 16, 2021 •

edited

Loading

seunghwak commented Sep 19, 2021

codecov-commenter commented Sep 20, 2021 •

edited

Loading

seunghwak commented Sep 20, 2021

ChuckHastings left a comment

BradReesWork commented Sep 22, 2021

[REVIEW] Improve memory scaling for low average vertex degree graphs & many GPUs #1823

[REVIEW] Improve memory scaling for low average vertex degree graphs & many GPUs #1823

Conversation

seunghwak commented Sep 16, 2021 • edited Loading

seunghwak commented Sep 19, 2021

codecov-commenter commented Sep 20, 2021 • edited Loading

Codecov Report

seunghwak commented Sep 20, 2021

ChuckHastings left a comment

Choose a reason for hiding this comment

BradReesWork commented Sep 22, 2021

seunghwak commented Sep 16, 2021 •

edited

Loading

codecov-commenter commented Sep 20, 2021 •

edited

Loading