Skip serializing blocks when persisting to db #3657

twoeths · 2022-01-22T10:13:57Z

Is your feature request related to a problem? Please describe.

A profile from contabo-17 which has low peer count shows that it takes 8% to serialize blocks due to Backfill sync:

Describe the solution you'd like

When fetching blocks from p2p, we already have binary data, we should be able to use that to persist to db without having to serialize() again

twoeths · 2022-01-22T10:18:55Z

it also takes 9% of cpu time to do hashTreeRoot()

0122_contabo_17_backfill_serialize.cpuprofile.zip

dapplion · 2022-01-22T10:25:43Z

To mitigate this issue we could slow down Backfill sync, such that only N blocks are hashed every X seconds to even out the load.

Also keeping the binary blob around to prevent re-serialization is a very nice trick easy to implement now

g11tech · 2022-01-22T11:33:59Z

To mitigate this issue we could slow down Backfill sync, such that only N blocks are hashed every X seconds to even out the load.

should we expose this as a configurable param with a hidden cli arg?

twoeths · 2022-01-24T02:43:10Z

should we expose this as a configurable param with a hidden cli arg?

year I prefer that. Also should we run this in a separate worker thread?

dapplion · 2022-02-04T05:05:50Z

Also should we run this in a separate worker thread?

Actually maybe! I think this is a completely independent process that given the initial initial conditions it does not require communication with the main thread at all. However, is our database thread safe? Can it handle multiple writes from different workers?

twoeths · 2022-02-08T08:12:34Z

there are other Backfill Sync performance issues mentioned in #3732 (comment)

g11tech · 2022-02-08T16:17:14Z

should i remove the bytes caching bit from this PR (which we can add once ssz v2 PR is in): #3669, it will save the costs of doing hashTreeRoot for parent/child relationship validation (about 8% CPU as previously profiled by @tuyennhv ).
Also one will be able to specify the batchSize on the terminal to reduce any adverse impact of backfill

dapplion assigned g11tech Jan 22, 2022

g11tech mentioned this issue Jan 26, 2022

Optimize backfill sync to efficiently use reqresp fetched block data #3669

Closed

dapplion added prio-medium Resolve this some time soon (tm). scope-performance Performance issue and ideas to improve performance. labels May 10, 2022

dapplion unassigned g11tech May 29, 2023

dapplion mentioned this issue May 29, 2023

feat: skip serializing block after fetching from network #5573

Merged

twoeths closed this as completed in #5573 Jun 19, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Skip serializing blocks when persisting to db #3657

Skip serializing blocks when persisting to db #3657

twoeths commented Jan 22, 2022

twoeths commented Jan 22, 2022

dapplion commented Jan 22, 2022 •

edited

Loading

g11tech commented Jan 22, 2022

twoeths commented Jan 24, 2022

dapplion commented Feb 4, 2022

twoeths commented Feb 8, 2022

g11tech commented Feb 8, 2022 •

edited

Loading

Skip serializing blocks when persisting to db #3657

Skip serializing blocks when persisting to db #3657

Comments

twoeths commented Jan 22, 2022

twoeths commented Jan 22, 2022

dapplion commented Jan 22, 2022 • edited Loading

g11tech commented Jan 22, 2022

twoeths commented Jan 24, 2022

dapplion commented Feb 4, 2022

twoeths commented Feb 8, 2022

g11tech commented Feb 8, 2022 • edited Loading

dapplion commented Jan 22, 2022 •

edited

Loading

g11tech commented Feb 8, 2022 •

edited

Loading