Various fixes for large columns. #17691

gianm · 2025-01-31T20:16:13Z

This patch fixes a class of bugs where various primitive column readers were not providing a SmooshedFileMapper to GenericIndexed, even though the corresponding writer could potentially write multi-file columns. For example, #7943 is an instance of this bug.

This patch also includes a fix for an issue on the writer for compressed multi-value string columns, V3CompressedVSizeColumnarMultiIntsSerializer, where it would use the same base filename for both the offset and values sections. This bug would only be triggered for segments in excess of 500 million rows. When a segment has fewer rows than that, it could potentially have a values section that needs to be split over multiple files, but the offset is never more than 4 bytes per row. This bug was triggered by the new tests, which use a smaller fileSizeLimit.

This patch also removes the 2-arg overload of GenericIndexed#read (which doesn't accept a SmooshedFileMapper). The 3-arg one can be used with the SmooshedFileMapper arg set to null if the caller really doesn't want to use a mapper. However, production callers generally do.

This patch fixes a class of bugs where various primitive column readers were not providing a SmooshedFileMapper to GenericIndexed, even though the corresponding writer could potentially write multi-file columns. For example, apache#7943 is an instance of this bug. This patch also includes a fix for an issue on the writer for compressed multi-value string columns, V3CompressedVSizeColumnarMultiIntsSerializer, where it would use the same base filename for both the offset and values sections. This bug would only be triggered for segments in excess of 500 million rows. When a segment has fewer rows than that, it could potentially have a values section that needs to be split over multiple files, but the offset is never more than 4 bytes per row. This bug was triggered by the new tests, which use a smaller fileSizeLimit.

clintropolis

👍

clintropolis · 2025-01-31T20:36:35Z

...ain/java/org/apache/druid/compressedbigdecimal/CompressedBigDecimalLongColumnSerializer.java

@@ -66,7 +67,8 @@ public static CompressedBigDecimalLongColumnSerializer create(
            segmentWriteOutMedium,
            String.format(Locale.ROOT, "%s.magnitude", filenameBase),
            Integer.MAX_VALUE,
-            CompressionStrategy.LZ4
+            CompressionStrategy.LZ4,
+            GenericIndexedWriter.MAX_FILE_SIZE


is there any reason to call this in non-test code with any value other than GenericIndexedWriter.MAX_FILE_SIZE? like wondering if we should make a version of the creators that automatically passes this argument in and mark the one that takes the size argument as for tests?

I thought about this, but this seemed fine enough and makes the GenericIndexedWriter API less cluttered. Having two overloads of the method with slightly different arguments seemed like it might be confusing.

yea, i guess new callers of these methods aren’t very frequent so maybe its ok. I was just afraid it just seems more confusing for the production code to be accepting an argument where there is basically one reasonable value to ever pass in, so any new callers should hopefully check around to see what the other callers are doing to pass in the same constant. Maybe we should add javadocs on these create methods to indicate that most callers should specify GenericIndexedWriter.MAX_FILE_SIZE as the argument except for testing?

I added a javadoc to the GenericIndexedWriter#ofCompressedByteBuffers and V3CompressedVSizeColumnarMultiIntsSerializer#create methods. There are some other methods that now accept fileSizeLimit, but they are package-private so are really only used by tests and by closely-related code. I didn't add javadocs to those since it didn't seem necessary.

processing/src/main/java/org/apache/druid/segment/MetricHolder.java

@@ -72,7 +79,7 @@

  private static <T> GenericIndexed<T> read(ByteBuffer buf, ComplexMetricSerde serde)
  {
-    return GenericIndexed.read(buf, serde.getObjectStrategy());
+    return GenericIndexed.read(buf, serde.getObjectStrategy(), null);


cryptoe · 2025-02-03T01:44:55Z

...rc/main/java/org/apache/druid/segment/data/V3CompressedVSizeColumnarMultiIntsSerializer.java

  )
  {
    return new V3CompressedVSizeColumnarMultiIntsSerializer(
        columnName,
        new CompressedColumnarIntsSerializer(
            columnName,
            segmentWriteOutMedium,
-            filenameBase,
+            filenameBase + ".offsets",


Could you please explain why the offsets and value need to be added. Is it QOL improvements and not directly related to this patch ?
How does backward compatibility look post this change, ie can older segment reading code still read these files ?

This was fixing another bug, mentioned in the PR description:

This patch also includes a fix for an issue on the writer for compressed multi-value string columns, V3CompressedVSizeColumnarMultiIntsSerializer, where it would use the same base filename for both the offset and values sections. This bug would only be triggered for segments in excess of 500 million rows. When a segment has fewer rows than that, it could potentially have a values section that needs to be split over multiple files, but the offset is never more than 4 bytes per row. This bug was triggered by the new tests, which use a smaller fileSizeLimit.

Backward compatibility would be OK, because the seconary filenames are written into the primary file. They aren't hard-coded on the reader side. So, older readers will still be able to read these differently-named files.

cryptoe · 2025-02-03T01:45:06Z

processing/src/main/java/org/apache/druid/segment/data/GenericIndexedWriter.java

      final Closer closer
  )
  {
    GenericIndexedWriter<ByteBuffer> writer = new GenericIndexedWriter<>(
        segmentWriteOutMedium,
        filenameBase,
-        compressedByteBuffersWriteObjectStrategy(compressionStrategy, bufferSize, closer)
+        compressedByteBuffersWriteObjectStrategy(compressionStrategy, bufferSize, closer),
+        fileSizeLimit


@gianm It looks like this is the crux of the change?

The crux of the fix was passing in a SmooshedFileMapper to GenericIndexed#read in various call sites. The new fileSizeLimit parameter is really just there for testing.

* Various fixes for large columns. This patch fixes a class of bugs where various primitive column readers were not providing a SmooshedFileMapper to GenericIndexed, even though the corresponding writer could potentially write multi-file columns. For example, apache#7943 is an instance of this bug. This patch also includes a fix for an issue on the writer for compressed multi-value string columns, V3CompressedVSizeColumnarMultiIntsSerializer, where it would use the same base filename for both the offset and values sections. This bug would only be triggered for segments in excess of 500 million rows. When a segment has fewer rows than that, it could potentially have a values section that needs to be split over multiple files, but the offset is never more than 4 bytes per row. This bug was triggered by the new tests, which use a smaller fileSizeLimit. * Use a Random seed. * Remove erroneous test code. * Fix two compilation problems. * Add javadocs. * Another javadoc.

gianm added the Bug label Jan 31, 2025

github-actions bot added the Area - Segment Format and Ser/De label Jan 31, 2025

gianm mentioned this pull request Jan 31, 2025

DictionaryEncodedColumnPartSerde: Read values using smooshReader. #17688

Closed

gianm added 2 commits January 31, 2025 12:22

Use a Random seed.

96a71fc

Remove erroneous test code.

b409349

clintropolis approved these changes Jan 31, 2025

View reviewed changes

Fix two compilation problems.

2841c84

github-advanced-security bot found potential problems Jan 31, 2025

View reviewed changes

gianm added 2 commits January 31, 2025 13:35

Add javadocs.

93fbf40

Another javadoc.

b23ed50

cryptoe reviewed Feb 3, 2025

View reviewed changes

gianm merged commit 0be9815 into apache:master Feb 3, 2025
79 checks passed

gianm deleted the fix-large-columns branch February 3, 2025 18:12

gianm mentioned this pull request Feb 3, 2025

Possible bug when loading multivalue+multipart String columns #7943

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Various fixes for large columns. #17691

Various fixes for large columns. #17691

gianm commented Jan 31, 2025 •

edited

Loading

clintropolis left a comment

clintropolis Jan 31, 2025

gianm Jan 31, 2025

clintropolis Jan 31, 2025 •

edited

Loading

gianm Jan 31, 2025

cryptoe Feb 3, 2025

gianm Feb 3, 2025

cryptoe Feb 3, 2025

gianm Feb 3, 2025

Various fixes for large columns. #17691

Various fixes for large columns. #17691

Conversation

gianm commented Jan 31, 2025 • edited Loading

clintropolis left a comment

Choose a reason for hiding this comment

clintropolis Jan 31, 2025

Choose a reason for hiding this comment

gianm Jan 31, 2025

Choose a reason for hiding this comment

clintropolis Jan 31, 2025 • edited Loading

Choose a reason for hiding this comment

gianm Jan 31, 2025

Choose a reason for hiding this comment

cryptoe Feb 3, 2025

Choose a reason for hiding this comment

gianm Feb 3, 2025

Choose a reason for hiding this comment

cryptoe Feb 3, 2025

Choose a reason for hiding this comment

gianm Feb 3, 2025

Choose a reason for hiding this comment

gianm commented Jan 31, 2025 •

edited

Loading

clintropolis Jan 31, 2025 •

edited

Loading