[Merged by Bors] - atx syncer that persists results #5599

dshulyak · 2024-02-24T11:30:23Z

part of: #5553

when requested we ask configured number of peers for epoch info (collection of atxs from that epoch). on a successful response we save known ids, and will ask again only in 30 minutes (configurable). also on restart we check persisted data, and potentially avoiding eager queries, if last query was made close to the epoch end.

concurrently with requesting epoch info updates, we will download atxs from peers. download is scheduled in batches, so that we can report progress. if peer advertised invalid atx id, we will evict such id after reaching max number of retries (20 in the pr).

to make error checking possible i extended errors emitted by p2p/server and fetcher.

codecov · 2024-03-05T08:36:29Z

Codecov Report

Attention: Patch coverage is 85.11327% with 46 lines in your changes are missing coverage. Please review.

Project coverage is 79.8%. Comparing base (3e59df3) to head (11d1a45).

Files	Patch %	Lines
syncer/atxsync/syncer.go	89.5%	12 Missing and 8 partials ⚠️
sql/atxsync/atxsync.go	80.5%	6 Missing and 7 partials ⚠️
fetch/mesh_data.go	68.0%	8 Missing ⚠️
checkpoint/recovery.go	60.0%	1 Missing and 1 partial ⚠️
log/zap.go	60.0%	1 Missing and 1 partial ⚠️
syncer/syncer.go	83.3%	1 Missing ⚠️

Additional details and impacted files

@@           Coverage Diff            @@
##           develop   #5599    +/-   ##
========================================
  Coverage     79.7%   79.8%            
========================================
  Files          274     276     +2     
  Lines        27883   28180   +297     
========================================
+ Hits         22228   22490   +262     
- Misses        4108    4128    +20     
- Partials      1547    1562    +15

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

syncer/atxsync/syncer.go

poszu · 2024-03-06T13:56:59Z

syncer/atxsync/syncer.go

+			}
+		}
+
+		for atx, requests := range state {


Am I missing something or requests will always be 0?

it is updated in code below

if errors.As(err, &batchError) { for hash, err := range batchError.Errors { if errors.Is(err, server.ErrPeerResponseFailed) { state[types.ATXID(hash)]++ } else if errors.Is(err, pubsub.ErrValidationReject) { state[types.ATXID(hash)] = s.cfg.RequestsLimit } } }

syncer/atxsync/syncer.go

dshulyak · 2024-03-07T05:02:55Z

@ivan4th @poszu thanks for review, i addressed your comments

dshulyak · 2024-03-07T11:33:28Z

bors try

spacemesh-bors · 2024-03-07T11:55:09Z

try

Build failed:

ci-status

dshulyak · 2024-03-07T12:49:10Z

need to understand what changes wrt retries

…ith errors

dshulyak · 2024-03-07T13:14:00Z

bors try

spacemesh-bors · 2024-03-07T14:07:45Z

try

Build succeeded:

dshulyak · 2024-03-07T14:15:36Z

bors try

spacemesh-bors · 2024-03-07T15:07:37Z

try

Build succeeded:

dshulyak · 2024-03-07T15:11:02Z

bors try

spacemesh-bors · 2024-03-07T15:58:47Z

try

Build succeeded:

dshulyak · 2024-03-07T17:22:15Z

bors merge

part of: #5553 when requested we ask configured number of peers for epoch info (collection of atxs from that epoch). on a successful response we save known ids, and will ask again only in 30 minutes (configurable). also on restart we check persisted data, and potentially avoiding eager queries, if last query was made close to the epoch end. concurrently with requesting epoch info updates, we will download atxs from peers. download is scheduled in batches, so that we can report progress. if peer advertised invalid atx id, we will evict such id after reaching max number of retries (20 in the pr). to make error checking possible i extended errors emitted by p2p/server and fetcher.

spacemesh-bors · 2024-03-07T17:44:54Z

Build failed:

ci-status

dshulyak · 2024-03-07T17:53:49Z

bors cancel

#5652

dshulyak · 2024-03-07T17:53:56Z

bors merge

part of: #5553 when requested we ask configured number of peers for epoch info (collection of atxs from that epoch). on a successful response we save known ids, and will ask again only in 30 minutes (configurable). also on restart we check persisted data, and potentially avoiding eager queries, if last query was made close to the epoch end. concurrently with requesting epoch info updates, we will download atxs from peers. download is scheduled in batches, so that we can report progress. if peer advertised invalid atx id, we will evict such id after reaching max number of retries (20 in the pr). to make error checking possible i extended errors emitted by p2p/server and fetcher.

dshulyak · 2024-03-07T18:24:06Z

bors cancel

spacemesh-bors · 2024-03-07T18:24:09Z

Canceled.

dshulyak · 2024-03-07T18:26:13Z

bors merge

part of: #5553 when requested we ask configured number of peers for epoch info (collection of atxs from that epoch). on a successful response we save known ids, and will ask again only in 30 minutes (configurable). also on restart we check persisted data, and potentially avoiding eager queries, if last query was made close to the epoch end. concurrently with requesting epoch info updates, we will download atxs from peers. download is scheduled in batches, so that we can report progress. if peer advertised invalid atx id, we will evict such id after reaching max number of retries (20 in the pr). to make error checking possible i extended errors emitted by p2p/server and fetcher.

spacemesh-bors · 2024-03-07T19:20:28Z

Pull request successfully merged into develop.

Build succeeded:

…edness (#5600) closes: #5553 this change is on top of #5599, and will be rebased after that one is merged the change always spawns a background worker to download atxs for the ongoing epoch. such background worker for epoch X is spawned when half of the layers of epoch X have passed. this is done so that we always do some useful work to do. such worker will be asking N peers (with a default of 2) every 30 minutes, until we are ready to spawn worker for epoch X + 1. the side effect of this change, is that we are not going to block syncedness if node has all previous atx, but not from the last epoch. so node will be able to rejoin consensus faster after restart without any risk.

xearl4 · 2024-03-12T19:57:03Z

@dshulyak could you explain why atx syncer state is persisted into local.sql and not into state.sql? do i understand it correctly, that, if we want to copy a synced blockchain state.sql from one full node to another, we'd now also have to copy the atx sync state tables from local.sql?

dshulyak · 2024-03-13T17:28:37Z

no that information is meant only as an optimization to avoid asking peers for a data every time node boots. i would suggest not to copy that, just wait a bit longer when node boots up the first time. in future versions that data will be removed completely, and peers will checking atx consistency always in background.

dshulyak mentioned this pull request Feb 27, 2024

validate ballot using weight from the local activeset #5598

Open

dshulyak force-pushed the persisted-atxsync branch from 25b30d8 to 0730bf3 Compare February 29, 2024 13:36

dshulyak mentioned this pull request Mar 1, 2024

continuously send new activations and malfeasance proofs to peers #5306

Closed

dshulyak force-pushed the persisted-atxsync branch 2 times, most recently from 2259d6d to 5d36ed9 Compare March 5, 2024 07:25

reworked atx sync that persists results incrementally

addbaa6

dshulyak force-pushed the persisted-atxsync branch from 5d36ed9 to addbaa6 Compare March 5, 2024 07:25

dshulyak marked this pull request as ready for review March 5, 2024 08:20

dshulyak requested review from fasmat, poszu and ivan4th as code owners March 5, 2024 08:20

fix error assertion

12a8e5a

rename fields to publish/target to avoid confusion

a3d060d

dshulyak mentioned this pull request Mar 6, 2024

[Merged by Bors] - download atxs from current epoch in background without blockinng syncedness #5600

Closed

poszu reviewed Mar 6, 2024

View reviewed changes

dshulyak added 3 commits March 7, 2024 05:32

fix review comments

6402c81

clear atxsync

8752391

more review comments

9e7a059

dshulyak added the area/sync label Mar 7, 2024

fix config

d9658d2

poszu approved these changes Mar 7, 2024

View reviewed changes

spacemesh-bors bot added a commit that referenced this pull request Mar 7, 2024

Try #5599:

f938a53

interruptible could make calls depending on concurrency

3a8f1b8

use correct error and remove marhshal log object as it doesn't work w…

712fde6

…ith errors

spacemesh-bors bot added a commit that referenced this pull request Mar 7, 2024

Try #5599:

b8039ef

Merge branch 'develop' into persisted-atxsync

c89e21b

spacemesh-bors bot added a commit that referenced this pull request Mar 7, 2024

Try #5599:

7e9efcb

spacemesh-bors bot added a commit that referenced this pull request Mar 7, 2024

Try #5599:

313a6d5

protect from accidental error

11d1a45

spacemesh-bors bot changed the title ~~atx syncer that persists results~~ [Merged by Bors] - atx syncer that persists results Mar 7, 2024

spacemesh-bors bot closed this Mar 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Merged by Bors] - atx syncer that persists results #5599

[Merged by Bors] - atx syncer that persists results #5599

dshulyak commented Feb 24, 2024 •

edited

Loading

codecov bot commented Mar 5, 2024 •

edited

Loading

poszu Mar 6, 2024

dshulyak Mar 7, 2024

dshulyak commented Mar 7, 2024

dshulyak commented Mar 7, 2024

spacemesh-bors bot commented Mar 7, 2024

dshulyak commented Mar 7, 2024

dshulyak commented Mar 7, 2024

spacemesh-bors bot commented Mar 7, 2024

dshulyak commented Mar 7, 2024

spacemesh-bors bot commented Mar 7, 2024

dshulyak commented Mar 7, 2024

spacemesh-bors bot commented Mar 7, 2024

dshulyak commented Mar 7, 2024

spacemesh-bors bot commented Mar 7, 2024

dshulyak commented Mar 7, 2024

dshulyak commented Mar 7, 2024

dshulyak commented Mar 7, 2024

spacemesh-bors bot commented Mar 7, 2024

dshulyak commented Mar 7, 2024

spacemesh-bors bot commented Mar 7, 2024

xearl4 commented Mar 12, 2024

dshulyak commented Mar 13, 2024

[Merged by Bors] - atx syncer that persists results #5599

[Merged by Bors] - atx syncer that persists results #5599

Conversation

dshulyak commented Feb 24, 2024 • edited Loading

codecov bot commented Mar 5, 2024 • edited Loading

Codecov Report

poszu Mar 6, 2024

Choose a reason for hiding this comment

dshulyak Mar 7, 2024

Choose a reason for hiding this comment

dshulyak commented Mar 7, 2024

dshulyak commented Mar 7, 2024

spacemesh-bors bot commented Mar 7, 2024

try

dshulyak commented Mar 7, 2024

dshulyak commented Mar 7, 2024

spacemesh-bors bot commented Mar 7, 2024

try

dshulyak commented Mar 7, 2024

spacemesh-bors bot commented Mar 7, 2024

try

dshulyak commented Mar 7, 2024

spacemesh-bors bot commented Mar 7, 2024

try

dshulyak commented Mar 7, 2024

spacemesh-bors bot commented Mar 7, 2024

dshulyak commented Mar 7, 2024

dshulyak commented Mar 7, 2024

dshulyak commented Mar 7, 2024

spacemesh-bors bot commented Mar 7, 2024

dshulyak commented Mar 7, 2024

spacemesh-bors bot commented Mar 7, 2024

xearl4 commented Mar 12, 2024

dshulyak commented Mar 13, 2024

dshulyak commented Feb 24, 2024 •

edited

Loading

codecov bot commented Mar 5, 2024 •

edited

Loading