use mmap for nested column value to dictionary id lookup for more chill heap usage during serialization #14919

clintropolis · 2023-08-28T23:50:11Z

Description

Switches DictionaryIdLookup to serialize the value dictionaries and then use mmap to load into their respective Indexed implementations instead of using the the {x}2IntMap heap collections we were previously using, resulting in dramatically more predictable heap usage at very minor performance cost.

Before (the peaks are merge/publish, 10m rows, 5m max segment size):

After (the dips are merge/publish with the peaks at the end being bitmap index creation):

It does add a bit extra time to the task I used in my experiment:

but it seems worth the cost, and probably in practice potentially faster than the heap based approach when the task has a lot less room to spare and faces more expensive gc cycles. I will be looking for other areas to improve in exchange for this change.

I also refactored a few things to clean some stuff up (mainly consolidating code around reading string dictionaries when the dictionary might be front-coded or classic generic indexed based that was duplicated in a bunch of places).

This PR has:

been self-reviewed.
added Javadocs for most classes and all non-trivial methods. Linked related entities via Javadoc links.
added comments explaining the "why" and the intent of the code wherever would not be obvious for an unfamiliar reader.
been tested in a test Druid cluster.

…ll heap usage

processing/src/main/java/org/apache/druid/segment/nested/DictionaryIdLookup.java

processing/src/test/java/org/apache/druid/segment/data/FixedIndexedTest.java

+          Assert.assertEquals(" index: " + i, LONGS[i - 1], writer.get(i));
+        }
+      } else {
+        Assert.assertEquals(" index: " + i, LONGS[i], writer.get(i));


processing/src/main/java/org/apache/druid/segment/nested/NestedDataColumnSerializer.java

processing/src/main/java/org/apache/druid/segment/nested/VariantColumnSerializer.java

pranavbhole · 2023-08-31T01:00:44Z

processing/src/main/java/org/apache/druid/segment/nested/DictionaryIdLookup.java

+        smoosher.close();
+        stringBufferMapper = SmooshedFileMapper.load(stringSmoosh);
+        final ByteBuffer stringBuffer = stringBufferMapper.mapFile(fileName);
+        stringDictionary = StringEncodingStrategies.getStringDictionarySupplier(


do we handle string Dictionary with encoding other than utf8?

not sure what the question here is? We currently support 2 encoding strategies for string dictionaries, plain utf8 and front-coded which also stores utf8 strings, just incrementally encoded. https://github.com/apache/druid/blob/master/processing/src/main/java/org/apache/druid/segment/column/StringEncodingStrategy.java#L40

which strategy that is used is controlled via the IndexSpec https://github.com/apache/druid/blob/master/processing/src/main/java/org/apache/druid/segment/IndexSpec.java#L84

pranavbhole · 2023-08-31T01:03:16Z

processing/src/main/java/org/apache/druid/segment/nested/DictionaryIdLookup.java

-    doubleLookup.defaultReturnValue(-1);
-    this.arrayLookup = new Object2IntAVLTreeMap<>(FrontCodedIntArrayIndexedWriter.ARRAY_COMPARATOR);
-    this.arrayLookup.defaultReturnValue(-1);
+    if (stringDictionary == null) {


It will be great idea to emit metrics of usage of the FileSmoosh to monitor the dictionaries larger than 2gb, not sure if we can somehow track that today.

what metric do you have in mind? Also why 2gb instead of just a metric on the size of the components? Also shouldn't such a metric really be much broader in scale and not just apply to nested columns? This doesn't feel like either a blocker or something that should be resolved in this PR, but agree it does sound potentially nice to have a better breakdown on what is driving segment size.

Tangentially related, I'd still like to do this someday #7124 to have another way to look inside segments and see what is driving the size of stuff

we can emit the metrics whenever we need to use File Smoosh for given large dictionary, this would be given us some warning signs that dictionary is getting bigger.

pranavbhole · 2023-08-31T01:05:25Z

processing/src/main/java/org/apache/druid/segment/nested/DictionaryIdLookup.java

+      // for strings because of this. if other type dictionary writers could potentially use multiple internal files
+      // in the future, we should transition them to using this approach as well (or build a combination smoosher and
+      // mapper so that we can have a mutable smoosh)
+      File stringSmoosh = FileUtils.createTempDir(name + "__stringTempSmoosh");


can this dictionary name conflict with some other dictionaries in the case if we have multiple instances running on the same machine?

behind FileUtils.createTempDir this is using Files.createTempDirectory which uses java.io.tmpdir, we use this in a lot of places when building segment files, so it should be safe.

to add more details, for a column named "nested", the temp dir looks something like this /var/folders/8y/mhfmxp391pl9m2h103s_kn200000gn/T/nested__stringTempSmoosh14881555230535603507, same true for the other temp files I'm generating that use Files.createTempFile directly, e.g. /var/folders/8y/mhfmxp391pl9m2h103s_kn200000gn/T/nested__longDictionary8646593343499944990.tmp

pranavbhole · 2023-09-11T20:39:42Z

Looks good to me.

…-serialization

#15068) Fixes a bug caused by #14919, which was just using the column name as part of a temp file name, which.. isn't very cool, my bad. Switched to use StringUtils.urlEncode so that ugly chars don't explode stuff. The modified test fails without the changes in this PR.

apache#15068) Fixes a bug caused by apache#14919, which was just using the column name as part of a temp file name, which.. isn't very cool, my bad. Switched to use StringUtils.urlEncode so that ugly chars don't explode stuff. The modified test fails without the changes in this PR.

clintropolis added 2 commits August 28, 2023 15:56

use mmap for nested column value to dictionary id lookup for more chi…

6765102

…ll heap usage

revert

6afd557

clintropolis added Performance Area - Segment Format and Ser/De Area - Ingestion labels Aug 28, 2023

clintropolis added 2 commits August 28, 2023 17:21

fix spotbugs

9d1a0b3

fix spotbugs for real

d52ce05

github-advanced-security bot found potential problems Aug 29, 2023

View reviewed changes

clintropolis added 2 commits August 28, 2023 19:18

adjust

b96cc26

rename

6ead24f

imply-cheddar approved these changes Aug 30, 2023

View reviewed changes

pranavbhole reviewed Aug 31, 2023

View reviewed changes

clintropolis added 2 commits September 12, 2023 14:08

Merge remote-tracking branch 'upstream/master' into more-chill-nested…

86d4f12

…-serialization

fixup v4 serializer for new constructor signature

5477423

github-actions bot removed the Area - Ingestion label Sep 12, 2023

fix test

b12c1aa

clintropolis merged commit 23b78c0 into apache:master Sep 13, 2023

clintropolis deleted the more-chill-nested-serialization branch September 13, 2023 04:01

clintropolis mentioned this pull request Oct 3, 2023

urlencode nested serializer temp file names so they dont explode stuff #15068

Merged

3 tasks

LakshSingla added this to the 28.0 milestone Oct 12, 2023

clintropolis mentioned this pull request Oct 23, 2023

cleanup temp files for nested column serializer #15236

Merged

3 tasks

LakshSingla mentioned this pull request Nov 4, 2023

[DRAFT] 28.0.0 release notes #15326

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

use mmap for nested column value to dictionary id lookup for more chill heap usage during serialization #14919

use mmap for nested column value to dictionary id lookup for more chill heap usage during serialization #14919

clintropolis commented Aug 28, 2023 •

edited

Loading

pranavbhole Aug 31, 2023

clintropolis Aug 31, 2023

pranavbhole Aug 31, 2023

clintropolis Aug 31, 2023 •

edited

Loading

pranavbhole Sep 11, 2023

pranavbhole Aug 31, 2023

clintropolis Aug 31, 2023

clintropolis Aug 31, 2023

pranavbhole commented Sep 11, 2023

use mmap for nested column value to dictionary id lookup for more chill heap usage during serialization #14919

use mmap for nested column value to dictionary id lookup for more chill heap usage during serialization #14919

Conversation

clintropolis commented Aug 28, 2023 • edited Loading

Description

pranavbhole Aug 31, 2023

Choose a reason for hiding this comment

clintropolis Aug 31, 2023

Choose a reason for hiding this comment

pranavbhole Aug 31, 2023

Choose a reason for hiding this comment

clintropolis Aug 31, 2023 • edited Loading

Choose a reason for hiding this comment

pranavbhole Sep 11, 2023

Choose a reason for hiding this comment

pranavbhole Aug 31, 2023

Choose a reason for hiding this comment

clintropolis Aug 31, 2023

Choose a reason for hiding this comment

clintropolis Aug 31, 2023

Choose a reason for hiding this comment

pranavbhole commented Sep 11, 2023

clintropolis commented Aug 28, 2023 •

edited

Loading

clintropolis Aug 31, 2023 •

edited

Loading