exp/lighthorizon/services: Fetch ledgers in parallel to them being processed. #4500

Shaptic · 2022-08-02T18:55:19Z

What

This modifies the ledger download+processing code to download subsets of the entire checkpoint range in parallel. Preliminary testing suggests that this can massively reduce request latency.

Before:

"GET http://localhost:8080/accounts/GBBJPGXTNNYBQGAIGONBFB6P7SUNO75GJEQ4Y7FVGMB2GPS3DJQVXSEP/transactions?cursor=179115963198013441&limit=10 HTTP/1.1" from 127.0.0.1:38834 - 200 199360B in 55.8414775s

After:

"GET http://localhost:8080/accounts/GBBJPGXTNNYBQGAIGONBFB6P7SUNO75GJEQ4Y7FVGMB2GPS3DJQVXSEP/transactions?cursor=179115963198013441&limit=10 HTTP/1.1" from 127.0.0.1:54188 - 200 199360B in 16.043121887s

(Note that the indices are local and no on-disk cache was used.)

Why

We can reduce latency by doing ledger downloads in parallel to processing. See #4468.

Known limitations

Obviously, neither of these represent acceptable levels of latency if we're talking about parity with classic Horizon. However, it's still a massive improvement over the previous code if we decide that high latency is acceptable for deeply historical requests.

exp/lighthorizon/services/main.go

Shaptic · 2022-08-02T20:56:00Z

exp/lighthorizon/services/main_test.go

+		On("GetLedger", mock.Anything, uint32(1586112)).Return(expectedLedger1, nil).
+		On("GetLedger", mock.Anything, uint32(1586113)).Return(expectedLedger2, nil).
+		On("GetLedger", mock.Anything, uint32(1586114)).Return(expectedLedger3, nil).
+		On("GetLedger", mock.Anything, mock.Anything).Return(xdr.LedgerCloseMeta{}, nil)


Does anyone know how to do a "any context.Context instance" equivalent here? I tried mock.AnythingOfType but that wasn't it.

what was the syntax, looks like mock.AnythingOfType("context.Context") shoulda worked?

sreuland · 2022-08-08T20:56:26Z

exp/lighthorizon/services/cursor.go

@@ -10,6 +10,7 @@ import (
 type CursorManager interface {
 	Begin(cursor int64) (int64, error)
 	Advance() (int64, error)
+	Skip(count uint) (int64, error)


not suggestion change, just a question on interface ergo, do you think Advance(skip uint) would be less moving parts for same new functionality?

interesting idea, I think it's pretty ergo. seeing Advance(0) becomes a little unintuitive, but I'll try it out

ah, subtle, rather than skip semantics, it could be Advance(increment uint) so Advance(0) would be no-op ?

sreuland · 2022-08-08T21:16:10Z

exp/lighthorizon/services/main.go

+// returned group have completed. In contrast, this function closes the output
+// channel when all work has been submitted (or the context errors).
+//
+// FIXME: Should this be a part of archive.Archive?


+1, to refactor this ledger loading optimization to be encapsulated in Archive instead, i.e. this caller method would largely be unchanged, still uses Archive.GetLedger(ctx, nextLedger), the new perf optimizations for loading get relo'd within there ?

I started working through refactoring this and realized that we actually probably don't need most of the Archive interface anymore after Bartek's work in #4488 which gives us a clean MetaArchive abstraction, which I think was a big point of what we were trying to accomplish with Archive.

I'll try to throw up a PR tomorrow to clean that up.

sreuland

looks really good, existing tests exercise this right? I think is worthwhile to follow your FIXME intuition now rather than wait, and refactor the logic into Archive.GetLedger , it may incur different unit test coverage..I think tests on this service mocked out Archive.

Shaptic · 2022-08-17T16:24:23Z

Abandoning in lieu of two separate PRs related to the comment about hiding this behind Archive.GetLedger().

Shaptic added the Ingestion Lite label Aug 2, 2022

Shaptic self-assigned this Aug 2, 2022

Shaptic changed the title ~~exp/lighthorizon: Lighthorizon parallel fetch~~ exp/lighthorizon/services: Fetch ledgers in parallel to processing. Aug 2, 2022

Shaptic changed the title ~~exp/lighthorizon/services: Fetch ledgers in parallel to processing.~~ exp/lighthorizon/services: Fetch ledgers in parallel to them being processed. Aug 2, 2022

Shaptic commented Aug 2, 2022

View reviewed changes

exp/lighthorizon/services/main.go Show resolved Hide resolved

Shaptic commented Aug 2, 2022

View reviewed changes

exp/lighthorizon/services/main.go Show resolved Hide resolved

Shaptic force-pushed the lighthorizon_parallelFetch branch from 44b71f7 to b779bf7 Compare August 2, 2022 20:54

Shaptic commented Aug 2, 2022

View reviewed changes

Shaptic requested review from sreuland, 2opremio and a team August 4, 2022 16:00

sreuland reviewed Aug 8, 2022

View reviewed changes

sreuland approved these changes Aug 8, 2022

View reviewed changes

Shaptic added 5 commits August 8, 2022 16:39

Add downloadLedgers() helper to facilitate parallel downloads

031f943

Handle context errors better

0e712ce

Move helpers below exported functions

a875d4a

Handle skip and unaligned cursors better

7bde96a

Prevent races and deadlocks

b733b48

Shaptic force-pushed the lighthorizon_parallelFetch branch from b779bf7 to b733b48 Compare August 8, 2022 23:46

Shaptic closed this Aug 17, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

exp/lighthorizon/services: Fetch ledgers in parallel to them being processed. #4500

exp/lighthorizon/services: Fetch ledgers in parallel to them being processed. #4500

Shaptic commented Aug 2, 2022 •

edited

Loading

Shaptic Aug 2, 2022

sreuland Aug 8, 2022

sreuland Aug 8, 2022

Shaptic Aug 9, 2022

sreuland Aug 9, 2022 •

edited

Loading

sreuland Aug 8, 2022

Shaptic Aug 9, 2022

sreuland left a comment

Shaptic commented Aug 17, 2022

exp/lighthorizon/services: Fetch ledgers in parallel to them being processed. #4500

exp/lighthorizon/services: Fetch ledgers in parallel to them being processed. #4500

Conversation

Shaptic commented Aug 2, 2022 • edited Loading

What

Why

Known limitations

Shaptic Aug 2, 2022

Choose a reason for hiding this comment

sreuland Aug 8, 2022

Choose a reason for hiding this comment

sreuland Aug 8, 2022

Choose a reason for hiding this comment

Shaptic Aug 9, 2022

Choose a reason for hiding this comment

sreuland Aug 9, 2022 • edited Loading

Choose a reason for hiding this comment

sreuland Aug 8, 2022

Choose a reason for hiding this comment

Shaptic Aug 9, 2022

Choose a reason for hiding this comment

sreuland left a comment

Choose a reason for hiding this comment

Shaptic commented Aug 17, 2022

Shaptic commented Aug 2, 2022 •

edited

Loading

sreuland Aug 9, 2022 •

edited

Loading