Improve MG PageRank scalability #2038

seunghwak · 2022-01-26T23:40:52Z

Improve MG PageRank performance & scalability in multi-node many GPU systems

…utation (currently with the temporary mechanism to support stream priorities, eventually, rmm should be updated to support this)

…e and also seems like having an issue with 2^31 or more elements)

…raph_creation

…rtitions in parallel

…raph_creation

…g_pagerank2

…avoid malloc failure due to fragmentation with the pool allocator)

seunghwak · 2022-03-09T00:55:46Z

rerun tests

codecov-commenter · 2022-03-09T03:54:29Z

Codecov Report

❗ No coverage uploaded for pull request base (branch-22.04@93dba00). Click here to learn what that means.
The diff coverage is n/a.

@@               Coverage Diff               @@
##             branch-22.04    #2038   +/-   ##
===============================================
  Coverage                ?   73.63%           
===============================================
  Files                   ?      154           
  Lines                   ?    10327           
  Branches                ?        0           
===============================================
  Hits                    ?     7604           
  Misses                  ?     2723           
  Partials                ?        0

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 93dba00...a4f6528. Read the comment docs.

aschaffer · 2022-03-11T17:42:20Z

cpp/include/cugraph/prims/copy_v_transform_reduce_in_out_nbr.cuh

+      major_tmp_buffers.reserve(num_concurrent_loops);
+      for (size_t i = 0; i < num_concurrent_loops; ++i) {
+        size_t max_size{0};
+        for (size_t j = i; j < graph_view.get_number_of_local_adj_matrix_partitions();


Why not do a max_element() here (lines 581-584)? It's more readable and possibly faster. I understand you have a stride, but you can pass a sequence and calculate the stride inside a lambda comparer, etc.

How can I pass a sequence?

Can we do this without using thrust/boost (i.e. without using counting_iterator/transform_iterator).

We can create an additional vector storing a sequence, but then, I am not sure the code will be more readable.

Could you show me the code?

You can use counting_iterator/transform_iterator on std::vector<> with thrust::host policy and, say, thrust::reduce() or thrust::maximum_element() but the counter arithmetic can be messy. You're right the resulting code would probably NOT be any more readable.

aschaffer · 2022-03-12T01:45:40Z

cpp/include/cugraph/prims/copy_v_transform_reduce_in_out_nbr.cuh

+      major_tmp_buffers.reserve(num_concurrent_loops);
+      for (size_t i = 0; i < num_concurrent_loops; ++i) {
+        size_t max_size{0};
+        for (size_t j = i; j < graph_view.get_number_of_local_adj_matrix_partitions();


You can use counting_iterator/transform_iterator on std::vector<> with thrust::host policy and, say, thrust::reduce() or thrust::maximum_element() but the counter arithmetic can be messy. You're right the resulting code would probably NOT be any more readable.

ChuckHastings · 2022-03-14T16:38:52Z

@gpucibot merge

enable multi-stream execution and overlapping communication with comp…

092cf49

…utation (currently with the temporary mechanism to support stream priorities, eventually, rmm should be updated to support this)

seunghwak added 2 - In Progress improvement Improvement / enhancement to an existing function DO NOT MERGE Hold off on merging; see PR for details non-breaking Non-breaking change labels Jan 26, 2022

seunghwak added this to the 22.04 milestone Jan 26, 2022

seunghwak self-assigned this Jan 26, 2022

seunghwak requested a review from a team as a code owner January 26, 2022 23:40

seunghwak added 20 commits January 27, 2022 10:16

update group_by_and_count to not use reduce_by_key (which is expensiv…

6102cd1

…e and also seems like having an issue with 2^31 or more elements)

add time measurements (should be undone)

5146b19

Merge branch 'upstream_pr2044' into enh_mg_pagerank2

9d31282

cosmetic updates

965f0cd

Merge branch 'branch-22.04' of github.com:rapidsai/cugraph into enh_g…

5073304

…raph_creation

improve weak scaling behavior of renumber

7c02fcf

move is_first_in_run_t to graph_utils.cuh

0744160

avoid using device lambdas

077008b

Merge branch 'upstream_pr2044' into enh_mg_pagerank2

e0608bb

fix compile errors

6a0dfa1

Merge branch 'upstream_pr2044' into enh_mg_pagerank2

f9a9635

code cleanup

41645aa

Merge branch 'upstream_pr2044' into enh_mg_pagerank2

9bc0284

update copy_v_transform_reduce_in_out_nbr to process multiple edge pa…

6b4b682

…rtitions in parallel

fix overflow bug with 2^31 or more vertices

a0b009e

Merge branch 'upstream_pr2044' into enh_mg_pagerank2

c889302

Merge branch 'branch-22.04' of github.com:rapidsai/cugraph into enh_g…

b5cb9c4

…raph_creation

delete temporary performance measurement code

9a70472

delete additional temporary performance measurement code

214ada9

remove temporary performance measurement code

3a605b5

seunghwak added 7 commits February 21, 2022 20:28

Merge branch 'branch-22.04' of github.com:rapidsai/cugraph into enh_m…

56764c4

…g_pagerank2

Merge branch 'upstream_pr2081' into enh_mg_pagerank2

c369431

resolve merge conflicts

1692a46

added temporary code to experiment performance

a34ad17

additional cut in peak memory and maximum single allocation size (to …

b1e56e7

…avoid malloc failure due to fragmentation with the pool allocator)

Merge branch 'upstream_pr2081' into enh_mg_pagerank2

885f28a

resolve merge conflicts

b4bbb9d

seunghwak changed the title ~~[WIP][skip-ci] Improve MG PageRank scalability~~ Improve MG PageRank scalability Mar 9, 2022

seunghwak added 3 - Ready for Review and removed 2 - In Progress DO NOT MERGE Hold off on merging; see PR for details labels Mar 9, 2022

seunghwak added 4 commits March 8, 2022 17:02

remove temporary experimental code

d716f6c

remove temporary experimental code

99401f3

fix formatting error

0664216

undo copyright update with the file with no change

c136d3a

seunghwak requested review from kaatish and ChuckHastings March 9, 2022 01:12

clang-format & copyright year

a4f6528

BradReesWork requested a review from aschaffer March 11, 2022 17:17

aschaffer requested changes Mar 11, 2022

View reviewed changes

ChuckHastings approved these changes Mar 11, 2022

View reviewed changes

aschaffer approved these changes Mar 12, 2022

View reviewed changes

kaatish approved these changes Mar 12, 2022

View reviewed changes

rapids-bot bot merged commit 5fe65f6 into rapidsai:branch-22.04 Mar 14, 2022

seunghwak deleted the enh_mg_pagerank2 branch August 11, 2022 23:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve MG PageRank scalability #2038

Improve MG PageRank scalability #2038

seunghwak commented Jan 26, 2022 •

edited

Loading

seunghwak commented Mar 9, 2022

codecov-commenter commented Mar 9, 2022 •

edited

Loading

aschaffer Mar 11, 2022

seunghwak Mar 11, 2022

aschaffer Mar 12, 2022

aschaffer Mar 12, 2022

ChuckHastings commented Mar 14, 2022

Improve MG PageRank scalability #2038

Improve MG PageRank scalability #2038

Conversation

seunghwak commented Jan 26, 2022 • edited Loading

seunghwak commented Mar 9, 2022

codecov-commenter commented Mar 9, 2022 • edited Loading

Codecov Report

aschaffer Mar 11, 2022

Choose a reason for hiding this comment

seunghwak Mar 11, 2022

Choose a reason for hiding this comment

aschaffer Mar 12, 2022

Choose a reason for hiding this comment

aschaffer Mar 12, 2022

Choose a reason for hiding this comment

ChuckHastings commented Mar 14, 2022

seunghwak commented Jan 26, 2022 •

edited

Loading

codecov-commenter commented Mar 9, 2022 •

edited

Loading