Batch segment retrieval from the metadata store #15305

abhishekrb19 · 2023-11-02T01:31:46Z

Used segments retrieval fails when there are too many intervals (about 120 with Derby). Same thing can happen with multiple intervals as with unused segments.

Previously, the intervals weren't capped in the SQL query and is bloated by the fact that we add grouped OR clause per interval to handle the eternity interval here. This PR splits up the SELECT query into multiple batches, with 100 intervals/batch. This is similar to the batching strategy with a cap on max number of segments announced at once.

org.skife.jdbi.v2.exceptions.CallbackFailedException: org.skife.jdbi.v2.exceptions.UnableToCreateStatementException: 

java.sql.SQLSyntaxErrorException: Statement too complex. Try rewriting the query to remove complexity. Eliminating many duplicate expressions or breaking up the query and storing interim results in a temporary table can often help resolve this error. 
[statement:"SELECT payload FROM foo WHERE used = :used AND dataSource = :dataSource AND ((start < :end0 AND \"end\" > :start0) OR (start = '-146136543-09-08T08:23:32.096Z' AND "end" != '146140482-04-24T15:36:27.903Z' AND "end" > :start0) OR (start != '-146136543-09-08T08:23:32.096Z' AND "end" = '146140482-04-24T15:36:27.903Z' AND start < :end0) OR (start < :end1 AND \"end\" > :start1) OR (start = '-146136543-09-08T08:23:32.096Z' AND "end" != '146140482-04-24T15:36:27.903Z' AND "end" > :start1) OR (start != '-146136543-09-08T08:23:32.096Z' AND "end" = '146140482-04-24T15:36:27.903Z' AND start < :end1) OR (start < :end2 AND \"end\"

Core changes:

Segment retrieval is batched at the storage layer by intervals. It's capped at 100 intervals per batch.
With multiple batches, the iterators are concatenated together and returned to the caller.
Add a combination of unit tests that test the different scenarios with limit in, at and out of range scenarios with used and unused segments retrieval.

A fix localized to kill tasks that originally exposed this bug is available here - #15306. We'd still separately want this change in the server as the issue can happen more broadly.

This PR has:

… are retrieved. - This is a failing test case that needs to be ignored.

…r places).

server/src/main/java/org/apache/druid/metadata/SqlSegmentsMetadataQuery.java

…test

server/src/main/java/org/apache/druid/metadata/SqlSegmentsMetadataQuery.java

…test

server/src/test/java/org/apache/druid/metadata/IndexerSQLMetadataStorageCoordinatorTest.java

zachjsh

LGTM

* Add a unit test that fails when used segments with too many intervals are retrieved. - This is a failing test case that needs to be ignored. * Batch the intervals (use 100 as it's consistent with batching in other places). * move the filtering inside the batch * Account for limit cross the batch splits. * Adjustments * Fixup and add tests * small refactor * add more tests. * remove wrapper. * Minor edits * assert out of range

Add a unit test that fails when used segments with too many intervals…

8796015

… are retrieved. - This is a failing test case that needs to be ignored.

abhishekrb19 marked this pull request as draft November 2, 2023 02:35

abhishekrb19 added 3 commits November 1, 2023 20:00

Batch the intervals (use 100 as it's consistent with batching in othe…

d2d1d34

…r places).

move the filtering inside the batch

b3e0168

Account for limit cross the batch splits.

e264d7d

abhishekrb19 marked this pull request as ready for review November 2, 2023 06:35

Adjustments

921d783

github-advanced-security bot found potential problems Nov 2, 2023

View reviewed changes

server/src/main/java/org/apache/druid/metadata/SqlSegmentsMetadataQuery.java Fixed Show fixed Hide fixed

Merge branch 'master' into retrieve_used_segments_too_many_intervals_…

8a336fc

…test

abhishekrb19 changed the title ~~Used segments retrieval fails when there are too many intervals~~ Batch segment retrieval from the metadata store Nov 3, 2023

zachjsh self-requested a review November 3, 2023 18:25

zachjsh reviewed Nov 3, 2023

View reviewed changes

server/src/main/java/org/apache/druid/metadata/SqlSegmentsMetadataQuery.java Outdated Show resolved Hide resolved

abhishekrb19 added 6 commits November 6, 2023 08:48

Fixup and add tests

a9d17a0

small refactor

a525eff

add more tests.

8455f1b

remove wrapper.

b710b4f

Merge branch 'master' into retrieve_used_segments_too_many_intervals_…

17162dc

…test

Minor edits

3ed3aee

github-advanced-security bot found potential problems Nov 6, 2023

View reviewed changes

server/src/test/java/org/apache/druid/metadata/IndexerSQLMetadataStorageCoordinatorTest.java Fixed Show fixed Hide fixed

zachjsh approved these changes Nov 6, 2023

View reviewed changes

assert out of range

d3e0b7d

abhishekrb19 merged commit 2136dc3 into apache:master Nov 6, 2023

abhishekrb19 deleted the retrieve_used_segments_too_many_intervals_test branch November 6, 2023 19:30

LakshSingla added this to the 29.0.0 milestone Jan 29, 2024

LakshSingla mentioned this pull request Feb 13, 2024

[DRAFT] 29.0.0 release notes #15896

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Batch segment retrieval from the metadata store #15305

Batch segment retrieval from the metadata store #15305

abhishekrb19 commented Nov 2, 2023 •

edited

Loading

zachjsh left a comment

Batch segment retrieval from the metadata store #15305

Batch segment retrieval from the metadata store #15305

Conversation

abhishekrb19 commented Nov 2, 2023 • edited Loading

zachjsh left a comment

Choose a reason for hiding this comment

abhishekrb19 commented Nov 2, 2023 •

edited

Loading