Refactors get_snapshot_storages() #3760

brooksprumo · 2024-11-23T15:10:54Z

Problem

AccountsDb::get_snapshot_storages() is due for a refactor. We can speed it up for the common case, and also simplify.

Since #3737, we now call get_snapshot_storages() every time we clean. The observation here is we'll only need about 100 storages (on average), yet the current impl of get_snapshot_storages() Arc::clone's all the storages, and then filters out the unneeded ones. We can change this to only Arc::clone the useful ones instead.

Additionally, the filter step is done in parallel. When we only have 100 storages, the parallel execution does not help. In fact, with a chunk size of 5000, we end up getting zero benefit, but do have to pay the cost of running in the thread pool.

Summary of Changes

When getting storages, only Arc::clone the ones we need
When filtering, do not use a thread pool

Results

I ran this on against mnb and saw good results.

Since get_snapshot_storages() is called in two places, I wanted to look at the perf results in both.

clean

Here, we only need ~100 storages each time. So not Arc::cloning and not using the thread pool really helps. The PR ends up running consistently, and meaningfully, faster:

branch	get storages	filter	total
master	30-50 ms	100-400 us	30-50 ms
pr v1	11-13 ms	10-30 us	11-13 ms
pr v2	n/a	n/a	7-9 ms
pr v3	n/a	n/a	6-8 ms

taking full snapshots

Full snapshots do need all the storages, so we will end up Arc::cloning almost all the storages. And it's possible the thread pool does help here. For the most part, runtimes are pretty similar. The PR does have a worse worst-case filter time.

branch	get storages	filter	total
master	30-50 ms	30-40 ms	60-90 ms
pr v1	30-50 ms	30-50 ms	60-100 ms
pr v2	n/a	n/a	40-60 ms
pr v3	n/a	n/a	40-60 ms

Overall, I think the common case of clean makes this change clearly a win. There is maybe a slight slow down for full snapshots, but that code is both infrequent, and in the background, so I don't think it matters much. Additionally, by not using a thread pool, we may reduce resource usage for the system as a whole.

jeffwashington · 2024-11-23T17:31:52Z

this is fine. But, since it is now called very clean, another alternative is to get all roots less than the cutoff and then get storages based on roots. I imagine that's faster by a lot. Or, we just leave this like it is since by the time master is rolled out we are hopefully ready to skip rewrites finally and enable ancient packing. Not sure what to do. If you're not backporting this, I'd just leave it alone I imagine.

accounts-db/src/accounts_db.rs

brooksprumo · 2024-11-24T03:12:02Z

since it is now called very clean, another alternative is to get all roots less than the cutoff and then get storages based on roots. I imagine that's faster by a lot.

Yes, this does make an improvement for both clean and snapshots. Thanks for the suggestion!

Or, we just leave this like it is since by the time master is rolled out we are hopefully ready to skip rewrites finally and enable ancient packing. Not sure what to do. If you're not backporting this, I'd just leave it alone I imagine.

My initial idea was to backport to v2.1 but not v2.0. Since we're planning to activate skipping rewrites in v2.1, I want to reduce the negative impact of getting the snapshot storages in every iteration of clean. Not backporting to v2.1 will still help with snapshots, but not with clean.

HaoranYi · 2024-11-24T17:30:21Z

looks like ci is failing ...

Not sure if it is because of the pr or ci flakey

I will approve when ci passed.

brooksprumo · 2024-11-25T17:43:49Z

looks like ci is failing ...

Not sure if it is because of the pr or ci flakey

I will approve when ci passed.

Here's some discussion from Discord on the CI issue: https://discord.com/channels/428295358100013066/560503042458517505/1310571536221995068

We'll need PR #3774 to land, and then I can rebase this PR to get the CI fix.

brooksprumo · 2024-11-25T18:49:13Z

PR #3774 was merged. I've rebased and force-pushed to pull in this fix. No code was changed.

jeffwashington

lgtm

brooksprumo · 2024-11-25T21:09:19Z

Merging! @HaoranYi, I'm interpreting #3760 (comment) as an approval. Lmk if that's wrong!

brooksprumo · 2024-11-25T21:10:42Z

@jeffwashington @HaoranYi I'd like to backport to v2.1, since 2.1 and 2.0 are the main ones that'll be impacted by cleaning old storages during clean. Any objections?

mergify · 2024-11-25T21:18:02Z

Backports to the beta branch are to be avoided unless absolutely necessary for fixing bugs, security issues, and perf regressions. Changes intended for backport should be structured such that a minimum effective diff can be committed separately from any refactoring, plumbing, cleanup, etc that are not strictly necessary to achieve the goal. Any of the latter should go only into master and ride the normal stabilization schedule. Exceptions include CI/metrics changes, CLI improvements and documentation updates on a case by case basis.

(cherry picked from commit 8c7ae80)

HaoranYi · 2024-11-25T21:59:38Z

@jeffwashington @HaoranYi I'd like to backport to v2.1, since 2.1 and 2.0 are the main ones that'll be impacted by cleaning old storages during clean. Any objections?

Fine with me. Do you happen to have the updated benchmark time after the new changes?

brooksprumo · 2024-11-25T22:15:33Z

Do you happen to have the updated benchmark time after the new changes?

The v3 numbers correspond to the code that was merged. Are those the numbers you're looking for?

HaoranYi · 2024-11-25T23:16:42Z

yeah. lgtm.

(cherry picked from commit 8c7ae80)

Refactors get_snapshot_storages() (#3760) (cherry picked from commit 8c7ae80) Co-authored-by: Brooks <[email protected]>

brooksprumo self-assigned this Nov 23, 2024

brooksprumo marked this pull request as ready for review November 23, 2024 17:08

brooksprumo requested review from jeffwashington and HaoranYi November 23, 2024 17:08

HaoranYi reviewed Nov 23, 2024

View reviewed changes

accounts-db/src/accounts_db.rs Outdated Show resolved Hide resolved

brooksprumo requested a review from HaoranYi November 24, 2024 03:35

brooksprumo added 3 commits November 25, 2024 13:33

Refactors get_snapshot_storages()

2f61692

pr: get alive roots once, and check inside get_if()

4b824ae

pr: get the max alive root from the index, and check against that

9a5b097

brooksprumo force-pushed the get-snapshot-storages branch from 063e7a5 to 9a5b097 Compare November 25, 2024 18:39

jeffwashington approved these changes Nov 25, 2024

View reviewed changes

brooksprumo merged commit 8c7ae80 into anza-xyz:master Nov 25, 2024
41 checks passed

brooksprumo deleted the get-snapshot-storages branch November 25, 2024 21:09

brooksprumo added the v2.1 Backport to v2.1 branch label Nov 25, 2024

mergify bot pushed a commit that referenced this pull request Nov 25, 2024

Refactors get_snapshot_storages() (#3760)

3976744

(cherry picked from commit 8c7ae80)

mergify bot mentioned this pull request Nov 25, 2024

v2.1: Refactors get_snapshot_storages() (backport of #3760) #3785

Merged

brooksprumo added a commit that referenced this pull request Nov 26, 2024

Refactors get_snapshot_storages() (#3760)

7d7f7af

(cherry picked from commit 8c7ae80)

brooksprumo added a commit that referenced this pull request Nov 27, 2024

Refactors get_snapshot_storages() (#3760)

9d62df6

(cherry picked from commit 8c7ae80)

brooksprumo added a commit that referenced this pull request Dec 2, 2024

Refactors get_snapshot_storages() (#3760)

71cc91e

(cherry picked from commit 8c7ae80)

brooksprumo added a commit that referenced this pull request Dec 6, 2024

v2.1: Refactors get_snapshot_storages() (backport of #3760) (#3785)

24bd7c3

Refactors get_snapshot_storages() (#3760) (cherry picked from commit 8c7ae80) Co-authored-by: Brooks <[email protected]>

KirillLykov pushed a commit that referenced this pull request Dec 9, 2024

v2.1: Refactors get_snapshot_storages() (backport of #3760) (#3785)

719c3d3

Refactors get_snapshot_storages() (#3760) (cherry picked from commit 8c7ae80) Co-authored-by: Brooks <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactors get_snapshot_storages() #3760

Refactors get_snapshot_storages() #3760

brooksprumo commented Nov 23, 2024 •

edited

Loading

jeffwashington commented Nov 23, 2024

brooksprumo commented Nov 24, 2024 •

edited

Loading

HaoranYi commented Nov 24, 2024 •

edited

Loading

brooksprumo commented Nov 25, 2024

brooksprumo commented Nov 25, 2024

jeffwashington left a comment

brooksprumo commented Nov 25, 2024

brooksprumo commented Nov 25, 2024

mergify bot commented Nov 25, 2024

HaoranYi commented Nov 25, 2024

brooksprumo commented Nov 25, 2024 •

edited

Loading

HaoranYi commented Nov 25, 2024

Refactors get_snapshot_storages() #3760

Refactors get_snapshot_storages() #3760

Conversation

brooksprumo commented Nov 23, 2024 • edited Loading

Problem

Summary of Changes

Results

jeffwashington commented Nov 23, 2024

brooksprumo commented Nov 24, 2024 • edited Loading

HaoranYi commented Nov 24, 2024 • edited Loading

brooksprumo commented Nov 25, 2024

brooksprumo commented Nov 25, 2024

jeffwashington left a comment

Choose a reason for hiding this comment

brooksprumo commented Nov 25, 2024

brooksprumo commented Nov 25, 2024

mergify bot commented Nov 25, 2024

HaoranYi commented Nov 25, 2024

brooksprumo commented Nov 25, 2024 • edited Loading

HaoranYi commented Nov 25, 2024

brooksprumo commented Nov 23, 2024 •

edited

Loading

brooksprumo commented Nov 24, 2024 •

edited

Loading

HaoranYi commented Nov 24, 2024 •

edited

Loading

brooksprumo commented Nov 25, 2024 •

edited

Loading