[Feature] Unordered parallel graph walk V3 #3474

ruseinov · 2023-09-08T15:39:26Z

Summary of changes

Changes introduced in this pull request:

Introduced unordered parallel graph traversal to speed up operations that don't care about the block order.

Reference issue to close (if applicable)

Work on #3314

Change checklist

I have performed a self-review of my own code,
I have made corresponding changes to the documentation,
I have added tests that prove my fix is effective or that my feature works (if possible),
I have made sure the CHANGELOG is up-to-date. All user-facing changes should be reflected in this document.

ruseinov · 2023-09-12T11:57:58Z

unordered without "has"            traversed 87.43 GiB at 368.23 MiB/s in 00:04:03
unordered with "has"               traversed 87.43 GiB at 387.18 MiB/s in 00:03:51
unordered without parallel part    traversed 13.70 GiB at 252.89 MiB/s in 00:00:55
ordered                            traversed 87.43 GiB at 214.33 MiB/s in 00:06:57
ordered only sequential part       traversed 13.70 GiB at 257.50 MiB/s in 00:00:54

src/ipld/util.rs

Co-authored-by: Josh Jones <[email protected]>

elmattic · 2023-09-13T09:19:26Z

src/ipld/util.rs

@@ -437,3 +438,259 @@ impl<DB: Blockstore, T: Iterator<Item = Tipset> + Unpin> Stream for ChainStream<
        }
    }
 }
+
+enum UnorderedTask {


Any reason to use an enum here?

It's a leftover, will remove it for good. Thanks for spotting this.

elmattic · 2023-09-13T09:21:18Z

src/ipld/util.rs

+    tipset_iter: T,
+    stateroot_limit: ChainEpoch,
+) -> UnorderedChainStream<DB, T> {
+    let (sender, receiver) = kanal::bounded(2048);


Wondering how did you pick this 2048 value? Does the value has any impact on performances or memory usage?

It's somewhat arbitrary, 2048 worked well on my machine and memory usage for 2048 items is negligible. I'll introduce a constant to be more transparent.

elmattic · 2023-09-13T09:46:53Z

src/ipld/util.rs

+        task::spawn(async move {
+            let mut handles = JoinSet::new();
+
+            for _ in 0..num_cpus::get() {


Should I observe some speedup/slowdown if I increase/decrease the number of cores here? For example testing on a calibnet snapshot, I don't see a big difference in performances. Or should I run the benchmark command on a mainnet snapshot maybe?

Calibnet is too small of a snapshot, yeah. I have tested this and this setup worked better than others.

elmattic · 2023-09-13T09:53:39Z

src/ipld/util.rs

+                            }
+                        }
+
+                        // // Process block messages.


Can remove comment here.

ruseinov added 19 commits August 31, 2023 17:02

[Feature] Unordered paraller graph walk.

93c0c75

fix typo

d068f91

more TODO

0d71883

more comments

8edc643

more comments

4ebf509

Merge branch 'main' into ru/feature/unordered-graph-traversal

18bec6e

dump code

3979db6

code dump

fcfb544

alternative

cb59869

perf optimisation

f1192a5

switch to kanal

4c23142

worker number

ff737d1

fixes

d6eb6be

move tipset iter to the main thread

48b976e

fix and optimize

e1820c8

latest version

d9dba59

cleanup

96dbb9a

Merge branch 'main' into ru/feature/u-g-t-v3

605d579

fix imports

4f95b77

ruseinov marked this pull request as ready for review September 11, 2023 16:30

ruseinov requested a review from a team as a code owner September 11, 2023 16:30

ruseinov requested review from jdjaustin and elmattic and removed request for a team September 11, 2023 16:30

ruseinov added 3 commits September 12, 2023 13:04

fm

71c1dc7

remove pin

51c5d2c

fix fmt

45d0b3d

This was referenced Sep 12, 2023

[Feature] Unordered parallel graph walk V2 #3464

Closed

[Feature] Unordered paraller graph walk. #3436

Closed

jdjaustin reviewed Sep 12, 2023

View reviewed changes

src/ipld/util.rs Outdated Show resolved Hide resolved

jdjaustin reviewed Sep 12, 2023

View reviewed changes

src/ipld/util.rs Outdated Show resolved Hide resolved

ruseinov and others added 2 commits September 12, 2023 21:15

Update src/ipld/util.rs

a27d3b7

Co-authored-by: Josh Jones <[email protected]>

fix comments

8e2ad5a

jdjaustin approved these changes Sep 13, 2023

View reviewed changes

elmattic approved these changes Sep 13, 2023

View reviewed changes

fix review comments

6bb74b6

ruseinov enabled auto-merge September 13, 2023 11:49

ruseinov disabled auto-merge September 13, 2023 11:49

ruseinov added 2 commits September 13, 2023 14:04

fix seen

9c7a445

uniform

0fc51f7

ruseinov enabled auto-merge September 13, 2023 12:04

ruseinov added this pull request to the merge queue Sep 13, 2023

Merged via the queue into main with commit f92b14e Sep 13, 2023

ruseinov deleted the ru/feature/u-g-t-v3 branch September 13, 2023 12:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] Unordered parallel graph walk V3 #3474

[Feature] Unordered parallel graph walk V3 #3474

ruseinov commented Sep 8, 2023 •

edited

Loading

ruseinov commented Sep 12, 2023

elmattic Sep 13, 2023

ruseinov Sep 13, 2023

elmattic Sep 13, 2023

ruseinov Sep 13, 2023

elmattic Sep 13, 2023 •

edited

Loading

ruseinov Sep 13, 2023

elmattic Sep 13, 2023

[Feature] Unordered parallel graph walk V3 #3474

[Feature] Unordered parallel graph walk V3 #3474

Conversation

ruseinov commented Sep 8, 2023 • edited Loading

Summary of changes

Reference issue to close (if applicable)

Other information and links

Change checklist

ruseinov commented Sep 12, 2023

elmattic Sep 13, 2023

Choose a reason for hiding this comment

ruseinov Sep 13, 2023

Choose a reason for hiding this comment

elmattic Sep 13, 2023

Choose a reason for hiding this comment

ruseinov Sep 13, 2023

Choose a reason for hiding this comment

elmattic Sep 13, 2023 • edited Loading

Choose a reason for hiding this comment

ruseinov Sep 13, 2023

Choose a reason for hiding this comment

elmattic Sep 13, 2023

Choose a reason for hiding this comment

ruseinov commented Sep 8, 2023 •

edited

Loading

elmattic Sep 13, 2023 •

edited

Loading