itest: add optional 10k asset test #325

jharveyb · 2023-05-25T17:52:07Z

Adds support for optional itests, starting with a stress test of the minter with a batch of 10,000 assets, each with 4kB of metadata.

Fixes #361.

guggero

Looks good! I like the idea with the two test lists, probably a bit easier to understand (and also to implement) than the build tag variant.

Makefile

jharveyb · 2023-05-26T16:22:33Z

Looks good! I like the idea with the two test lists, probably a bit easier to understand (and also to implement) than the build tag variant.

Another motivating reason was that trying to use mutually exclusive build tags to swap out the test list caused some IDE issues with gopls, which could cause issues later on.

jharveyb · 2023-05-31T16:36:44Z

Updated with a minimal batch minting stress test, which reliably reproduces the first perf. issue mentioned in #314.

With a batchSize of 100, minting succeeds, and we can also reproduce #313.

TODO list:

Use a custom flag for triggering optional itests instead of testing.short
Extend the batch minting stress test with other asserts like correct proof upload to the universe server, correct group anchor, correct syncing of all universe leaves to a second tapd, etc.

lightninglabs-deploy · 2023-06-12T17:52:02Z

@guggero: review reminder

guggero

Very cool test, great to know we can mint that many assets in a decent time. And also great to have found some performance tweaks along the way with this!

itest/assertions.go

itest/multi_asset_group_test.go

itest/assets_test.go

universe/base.go

itest/mint_batch_stress_test.go

itest/assets_test.go

guggero · 2023-06-20T15:35:29Z

I pushed up a couple of fixes and confirmed the 1k test succeeds locally on my machine both with SQLite and Postgres. The 10k test still takes too long to finish within the 60m test timeout, so I think we first need to get some of the optimizations in.
But IMO all the fixes in this PR are enough to justify getting it in as it currently is.

So there are multiple fixes in here that address #361.

Roasbeef · 2023-06-21T00:53:29Z

tapdb/sqlc/queries/universe.sql

    FROM universe_leaves leaves
    JOIN genesis_info_view gen
        ON leaves.asset_genesis_id = gen.gen_asset_id
-    WHERE gen.asset_id = @asset_id
+    RIGHT OUTER JOIN key_group_info_view groups


So I think we can simplify this, and just target the group key directly.

Retracing a bit of history here, the query first looked like this: https://github.com/lightninglabs/taproot-assets/blame/c037025ebb22c54087736589bd31fb570a04b738/tapdb/sqlc/queries/universe.sql#L101-L105

The issue with that is that the asset_id field there for assets with a group key is basically whichever asset was inserted first. As a result, if you tried to log the sync of one of those leaves, then things fail as maybe that was the wrong leaf.

We then changed it to target the leaf instead (since we know the asset id of the leaf), then use that to get the pointer ID back to the root.

However if we have the group key, then we can just target that diretly. So we can have a query that looks similar to the old one:

SELECT id FROM universe_roots WHERE group_key = @group_key

So with this, we don't need to modify that other view, we can just combine the queries and pick whichever doesn't return NULL

SELECT COALESCE( (SELECT leaves.universe_root_id AS id FROM universe_leaves leaves JOIN genesis_info_view gen ON leaves.asset_genesis_id = gen.gen_asset_id WHERE gen.asset_id = @asset_id), (SELECT id FROM universe_roots WHERE group_key = @group_key) ) AS id;

Thanks a lot! I think that did the trick (I went with a slightly different approach since I'm not sure how COALESCE behaves if there are no results or multiple results returned by a sub query). But the idea is the same.

In this commit, we add additional make commands to run optional itests, where specific test cases are still specified with the 'icase' variable. Additional optional itests are added in the same way as default itests. We also remove the old-style build tags that were replaced with Go 1.17.

In this commit, we update the logic used in all itests to mint assets to reduce the number of RPC calls made and total time spent on asserts. We also separate waiting for the planter to change state and checking the daemon state.

jharveyb · 2023-06-21T17:13:11Z

LGTM but not approving b/c original author.

ffranr

Nice work guys!

universe/base.go

universe/interface.go

tapgarden/caretaker.go

tapdb/sqlc/queries/universe.sql

Because the String() function just returns the hash of the ID, it is hard to debug whether both the asset ID and group key are set during universe sync. This commit adds a bit more information to certain log lines.

In this commit, we add a test that mints a large batch of collectibles with large metadata, and then checks that the batch mint succeeded. This includes correctly updating the universe server of the minting node, and syncing that universe tree to a second node.

If SQLITE_BUSY is returned, it means waiting for a transaction failed because too many other readers/writers are currently using the DB. We can detect that error, convert it into a serialization error and detect that in the existing retry loop. We also re-try if creating or committing a transaction fails with a serialization error.

Roasbeef · 2023-06-22T16:36:30Z

tapdb/universe.go

+	// operations to prepare and validate the keys. But since we're now
+	// actually starting to write, we want to limit this to a single writer
+	// at a time.
+	b.registrationMtx.Lock()


I think we should revert this change. We want to make sure that our db concurrency control can handle situations like this. For sqlite, there's only a single writer. For postgres, the new serialization mode should detect conflicts, then abort so it can retry safely.

Without this change we ran into "number of retries exhausted" errors quite quickly, both on SQLite and Postgres. That's mainly because we constantly sync multiple leaves at a time (number of CPUs in parallel).

Gotcha, perhaps we need to increase either the back off timeout, or the number of attempts (I just kinda chose 10 randomly). Wanting to try to tune these params, as otherwise we may end up hitting this in other areas that have more involved db transactions.

Seems like batching these transactions would also work? E.x. commit batches of 5 issuance proofs vs. single proofs.

IIUC in all cases where we hit this issue we also don't need to write these proofs individually to recover from a restart.

Yeah that'll work too, rn we just do them one by one, but we can also batch the insertion as well. We can easily batch when adding to our own universe (that for loop in the planter), and then for RPC, we can let it be a repeated field.

guggero reviewed May 26, 2023

View reviewed changes

Makefile Outdated Show resolved Hide resolved

jharveyb force-pushed the 10k_asset_test branch from 2bc6679 to 15a277b Compare May 31, 2023 16:29

jharveyb force-pushed the 10k_asset_test branch from 15a277b to cb81fce Compare June 9, 2023 20:16

jharveyb changed the base branch from main to universe_rpc_fixes June 9, 2023 20:16

jharveyb force-pushed the universe_rpc_fixes branch from 5fb7f89 to 9597bc0 Compare June 9, 2023 22:37

jharveyb force-pushed the 10k_asset_test branch from cb81fce to eb5b211 Compare June 12, 2023 17:44

jharveyb marked this pull request as ready for review June 12, 2023 17:46

jharveyb changed the base branch from universe_rpc_fixes to main June 12, 2023 17:47

jharveyb requested a review from guggero June 12, 2023 17:47

jharveyb mentioned this pull request Jun 12, 2023

universe: universe stats DB issues during universe sync #354

Closed

jharveyb force-pushed the 10k_asset_test branch 2 times, most recently from 8ccece5 to 40073b8 Compare June 13, 2023 19:07

guggero reviewed Jun 14, 2023

View reviewed changes

guggero reviewed Jun 16, 2023

View reviewed changes

itest/assets_test.go Outdated Show resolved Hide resolved

jharveyb force-pushed the 10k_asset_test branch from 40073b8 to 047fcad Compare June 16, 2023 21:49

guggero force-pushed the 10k_asset_test branch from 047fcad to 8cee117 Compare June 20, 2023 15:32

dstadulis added the v0.2.1 label Jun 20, 2023

Roasbeef reviewed Jun 21, 2023

View reviewed changes

guggero force-pushed the 10k_asset_test branch from 7a35d6d to 7c736f4 Compare June 21, 2023 09:02

jharveyb added 2 commits June 21, 2023 11:09

itest: reduce test harness overhead for minting

23b9e6c

In this commit, we update the logic used in all itests to mint assets to reduce the number of RPC calls made and total time spent on asserts. We also separate waiting for the planter to change state and checking the daemon state.

guggero force-pushed the 10k_asset_test branch from 7c736f4 to cff52b5 Compare June 21, 2023 09:10

itest: revert succeedEventually as it doesn't work

72c0ad4

guggero force-pushed the 10k_asset_test branch from cff52b5 to 6797ace Compare June 21, 2023 09:30

guggero approved these changes Jun 21, 2023

View reviewed changes

jharveyb requested a review from Roasbeef June 21, 2023 16:10

ffranr approved these changes Jun 21, 2023

View reviewed changes

universe/base.go Show resolved Hide resolved

universe/interface.go Outdated Show resolved Hide resolved

tapgarden/caretaker.go Show resolved Hide resolved

tapdb/sqlc/queries/universe.sql Show resolved Hide resolved

guggero and others added 7 commits June 22, 2023 08:40

universe: add more logging around universe IDs

529ee4b

Because the String() function just returns the hash of the ID, it is hard to debug whether both the asset ID and group key are set during universe sync. This commit adds a bit more information to certain log lines.

tapgarden: use no timeout for long operations

202e446

tapd: also query universe_root_id by group key

f2ff0bc

tapdb: create write lock for asset issuance

b190d32

make: increase test timeout

5f50445

guggero force-pushed the 10k_asset_test branch from 6797ace to 5f50445 Compare June 22, 2023 06:46

guggero added this pull request to the merge queue Jun 22, 2023

Merged via the queue into main with commit 6c4bd43 Jun 22, 2023

guggero deleted the 10k_asset_test branch June 22, 2023 07:50

Roasbeef reviewed Jun 22, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

itest: add optional 10k asset test #325

itest: add optional 10k asset test #325

jharveyb commented May 25, 2023 •

edited by guggero

Loading

guggero left a comment

jharveyb commented May 26, 2023

jharveyb commented May 31, 2023 •

edited

Loading

lightninglabs-deploy commented Jun 12, 2023

guggero left a comment

guggero commented Jun 20, 2023

Roasbeef Jun 21, 2023

guggero Jun 21, 2023

jharveyb commented Jun 21, 2023

ffranr left a comment

Roasbeef Jun 22, 2023

guggero Jun 22, 2023

Roasbeef Jun 22, 2023

jharveyb Jun 22, 2023

Roasbeef Jun 22, 2023

itest: add optional 10k asset test #325

itest: add optional 10k asset test #325

Conversation

jharveyb commented May 25, 2023 • edited by guggero Loading

guggero left a comment

Choose a reason for hiding this comment

jharveyb commented May 26, 2023

jharveyb commented May 31, 2023 • edited Loading

lightninglabs-deploy commented Jun 12, 2023

guggero left a comment

Choose a reason for hiding this comment

guggero commented Jun 20, 2023

Roasbeef Jun 21, 2023

Choose a reason for hiding this comment

guggero Jun 21, 2023

Choose a reason for hiding this comment

jharveyb commented Jun 21, 2023

ffranr left a comment

Choose a reason for hiding this comment

Roasbeef Jun 22, 2023

Choose a reason for hiding this comment

guggero Jun 22, 2023

Choose a reason for hiding this comment

Roasbeef Jun 22, 2023

Choose a reason for hiding this comment

jharveyb Jun 22, 2023

Choose a reason for hiding this comment

Roasbeef Jun 22, 2023

Choose a reason for hiding this comment

jharveyb commented May 25, 2023 •

edited by guggero

Loading

jharveyb commented May 31, 2023 •

edited

Loading