Dispute-coordinator: add LRU rw cache with write batching #5434

sandreim · 2022-05-02T12:58:42Z

More details: TBD.

Signed-off-by: Andrei Sandu <[email protected]> add logs and fix tests Signed-off-by: Andrei Sandu <[email protected]> this works Signed-off-by: Andrei Sandu <[email protected]> fix Signed-off-by: Andrei Sandu <[email protected]> buffer approval voting and fix tests Signed-off-by: Andrei Sandu <[email protected]>

Signed-off-by: Andrei Sandu <[email protected]>

node/core/dispute-coordinator/src/error.rs

Signed-off-by: Andrei Sandu <[email protected]>

eskimor · 2022-05-02T20:01:13Z

node/core/dispute-coordinator/src/backend.rs

-		self.inner.load_recent_disputes()
+		let read_timer = self.metrics.time_db_read_operation("recent_disputes");
+		let disputes = self.inner.load_recent_disputes()?;
+		drop(read_timer);


For a fair comparision, we should probably include the cache update?

yeah, I missed that.

Looking at it now - the questions is do we want to include the in-memory update time in the db read operation ? If so, I must measure the entire time spent in all of these Backend fns.

eskimor · 2022-05-02T20:12:07Z

node/core/dispute-coordinator/src/cache.rs

+			}
+		}
+
+		let entry = entry.expect("tested above; qed");


Can't we get rid of this expect, by just using pattern matching above?

It will never panic, but yeah, to play it safe I'll use a match.

ordian · 2022-05-02T21:33:26Z

node/core/dispute-coordinator/Cargo.toml

@@ -11,6 +11,8 @@ parity-scale-codec = "3.1.2"
 kvdb = "0.11.0"
 thiserror = "1.0.30"
 lru = "0.7.5"
+lru-cache = "0.1.2"


why are we using two different crates for the same thing?
lru-cache is also in maintenance mode

That's a good point, I switched to this while I was trying to refactor using references, but I don't recall the exact reason 🥲 . I'll drop it.

ordian · 2022-05-02T21:42:54Z

node/core/dispute-coordinator/src/cache.rs

+	/// The entry is in memory and was modified (needs to be persisted later).
+	Dirty,
+	/// The entry is cached and has been persisted in the DB.
+	Persisted,


how Persisted is different from Cached?
why do we use a state enum instead of keeping dirty stuff separately (ala data-oriented design)?

Persisted is the same as Cached, plus the fact that we know we wrote it to disk at least once. I was hoping to expose the state transitions as metrics in the future if this cache is useful. So that's why I am using the extra Persisted state.

We do keep stuff separately, but that's just what we evict from the LRU cache. This leads to more code and increases the complexity, as we need to lookup multiple hashmaps. I am trying to avoid this for everything else.

I meant the following layout instead:

State { earliest_session: Option<SessionIndex>, recent_disputes: Option<RecentDisputes>, candidate_votes: LruCache<(SessionIndex, CandidateHash), CandidateVotes>, } OverlayCache { cached: State, dirty: State, }

This would work as well, but the same applies regarding complexity. The wrapper type also enables us to observe lifetime of cache entries in the future.

ordian · 2022-05-02T21:49:16Z

node/core/dispute-coordinator/src/cache.rs

+#[cfg(test)]
+pub const WRITE_BACK_INTERVAL: Duration = Duration::from_secs(0);
+
+/// A read-only cache (LRU) which records changes and outputs them as `BackendWriteOp` which can


I find the read-only part confusing

node/core/dispute-coordinator/src/cache.rs

…lkadot into sandreim/overlayed_backend

Signed-off-by: Andrei Sandu <[email protected]>

sandreim and others added 11 commits April 28, 2022 09:40

Move overlay out of loop

6a2fc4c

Signed-off-by: Andrei Sandu <[email protected]>

More metrics for dispute coordinator.

74015ae

WIP

7e3cd63

Signed-off-by: Andrei Sandu <[email protected]>

Impl compiles

fa777b5

Signed-off-by: Andrei Sandu <[email protected]>

Fix impl/tests

f50f305

Signed-off-by: Andrei Sandu <[email protected]>

remove debug prints

5ff0d2d

Signed-off-by: Andrei Sandu <[email protected]>

comments

99ceead

Signed-off-by: Andrei Sandu <[email protected]>

move out in separate file + 1 test

817a675

Signed-off-by: Andrei Sandu <[email protected]>

More testing

8b40f9c

Signed-off-by: Andrei Sandu <[email protected]>

Undo approval-voting changes

4eef3f1

Signed-off-by: Andrei Sandu <[email protected]>

github-actions bot added the A3-in_progress Pull request is in progress. No review needed at this stage. label May 2, 2022

wtf

35bc753

Signed-off-by: Andrei Sandu <[email protected]>

sandreim added B0-silent Changes should not be mentioned in any release notes C1-low PR touches the given topic and has a low impact on builders. labels May 2, 2022

sandreim commented May 2, 2022

View reviewed changes

node/core/dispute-coordinator/src/error.rs Show resolved Hide resolved

remove debug

0a51206

Signed-off-by: Andrei Sandu <[email protected]>

eskimor reviewed May 2, 2022

View reviewed changes

ordian reviewed May 2, 2022

View reviewed changes

Merge branch 'master' into sandreim/overlayed_backend

2aad528

sandreim commented May 3, 2022

View reviewed changes

node/core/dispute-coordinator/src/cache.rs Outdated Show resolved Hide resolved

sandreim added 2 commits May 3, 2022 09:25

Merge branch 'sandreim/overlayed_backend' of github.com:paritytech/po…

40edeef

…lkadot into sandreim/overlayed_backend

Make db read metrics fair

48c1442

Signed-off-by: Andrei Sandu <[email protected]>

sandreim closed this Aug 8, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dispute-coordinator: add LRU rw cache with write batching #5434

Dispute-coordinator: add LRU rw cache with write batching #5434

sandreim commented May 2, 2022 •

edited

Loading

eskimor May 2, 2022

sandreim May 3, 2022

sandreim May 3, 2022 •

edited

Loading

eskimor May 2, 2022

sandreim May 3, 2022

ordian May 2, 2022

sandreim May 3, 2022

ordian May 2, 2022 •

edited

Loading

sandreim May 3, 2022 •

edited

Loading

ordian May 3, 2022

sandreim May 3, 2022

ordian May 2, 2022

Dispute-coordinator: add LRU rw cache with write batching #5434

Dispute-coordinator: add LRU rw cache with write batching #5434

Conversation

sandreim commented May 2, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sandreim May 3, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ordian May 2, 2022 • edited Loading

Choose a reason for hiding this comment

sandreim May 3, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sandreim commented May 2, 2022 •

edited

Loading

sandreim May 3, 2022 •

edited

Loading

ordian May 2, 2022 •

edited

Loading

sandreim May 3, 2022 •

edited

Loading