expression vector processing improvements #17561

clintropolis · 2024-12-12T13:10:07Z

Description

changes:

introduces FilteredInputBinding which adds better conditional expression processing support using a VectorMatch internally to selectively evaluate input vectors instead of precomputing all inputs, with nvl updated to take advantage of this
refactor some stuff to streamline expression vector processor implementation for simple functions like most math and logical operations with some new factory classes
update vector identifier expression processor to delegate evaluating results directly to the input binding selectors with ExprEvalBindingVector
add maxVectorSize() to ExprVectorProcessor to avoid having to pass max vector size around everywhere

some benchmarks with nvl before and after:

SELECT NVL(string2, CONCAT(string1, '-', long2)), SUM(double1) FROM expressions GROUP BY 1 ORDER BY 2
SELECT NVL(string1, CONCAT(string3, '-', long2)), SUM(double1) FROM expressions GROUP BY 1 ORDER BY 2
SELECT NVL(long1, long5 + long3), SUM(double1) FROM expressions GROUP BY 1 ORDER BY 2

before:

Benchmark                        (complexCompression)  (deferExpressionDimensions)  (query)  (rowsPerSegment)  (schemaType)  (storageType)  (stringEncoding)  (vectorize)  Mode  Cnt    Score    Error  Units
SqlExpressionBenchmark.querySql                  none                 singleString       49           1500000      explicit           MMAP              UTF8        force  avgt    5  258.528 ±  2.046  ms/op
SqlExpressionBenchmark.querySql                  none                 singleString       53           1500000      explicit           MMAP              UTF8        force  avgt    5  275.814 ±  1.727  ms/op
SqlExpressionBenchmark.querySql                  none                 singleString       57           1500000      explicit           MMAP              UTF8        force  avgt    5   68.072 ±  1.334  ms/op
SqlExpressionBenchmark.querySql                  none                   fixedWidth       49           1500000      explicit           MMAP              UTF8        force  avgt    5  475.695 ±  5.691  ms/op
SqlExpressionBenchmark.querySql                  none                   fixedWidth       53           1500000      explicit           MMAP              UTF8        force  avgt    5  476.026 ± 19.507  ms/op
SqlExpressionBenchmark.querySql                  none                   fixedWidth       57           1500000      explicit           MMAP              UTF8        force  avgt    5  479.159 ±  6.044  ms/op
SqlExpressionBenchmark.querySql                  none         fixedWidthNonNumeric       49           1500000      explicit           MMAP              UTF8        force  avgt    5  477.816 ±  6.072  ms/op
SqlExpressionBenchmark.querySql                  none         fixedWidthNonNumeric       53           1500000      explicit           MMAP              UTF8        force  avgt    5  470.072 ± 14.624  ms/op
SqlExpressionBenchmark.querySql                  none         fixedWidthNonNumeric       57           1500000      explicit           MMAP              UTF8        force  avgt    5   69.851 ±  1.485  ms/op
SqlExpressionBenchmark.querySql                  none                       always       49           1500000      explicit           MMAP              UTF8        force  avgt    5  477.870 ±  3.244  ms/op
SqlExpressionBenchmark.querySql                  none                       always       53           1500000      explicit           MMAP              UTF8        force  avgt    5  474.052 ± 15.498  ms/op
SqlExpressionBenchmark.querySql                  none                       always       57           1500000      explicit           MMAP              UTF8        force  avgt    5  471.010 ±  3.207  ms/op

after:

Benchmark                        (complexCompression)  (deferExpressionDimensions)  (query)  (rowsPerSegment)  (schemaType)  (storageType)  (stringEncoding)  (vectorize)  Mode  Cnt    Score    Error  Units
SqlExpressionBenchmark.querySql                  none                 singleString       49           1500000      explicit           MMAP              UTF8        force  avgt    5  204.239 ±  2.762  ms/op
SqlExpressionBenchmark.querySql                  none                 singleString       53           1500000      explicit           MMAP              UTF8        force  avgt    5  216.547 ±  1.920  ms/op
SqlExpressionBenchmark.querySql                  none                 singleString       57           1500000      explicit           MMAP              UTF8        force  avgt    5   38.502 ±  0.968  ms/op
SqlExpressionBenchmark.querySql                  none                   fixedWidth       49           1500000      explicit           MMAP              UTF8        force  avgt    5  480.898 ±  8.471  ms/op
SqlExpressionBenchmark.querySql                  none                   fixedWidth       53           1500000      explicit           MMAP              UTF8        force  avgt    5  457.998 ±  5.167  ms/op
SqlExpressionBenchmark.querySql                  none                   fixedWidth       57           1500000      explicit           MMAP              UTF8        force  avgt    5  476.647 ±  4.343  ms/op
SqlExpressionBenchmark.querySql                  none         fixedWidthNonNumeric       49           1500000      explicit           MMAP              UTF8        force  avgt    5  476.482 ±  3.979  ms/op
SqlExpressionBenchmark.querySql                  none         fixedWidthNonNumeric       53           1500000      explicit           MMAP              UTF8        force  avgt    5  457.149 ±  9.338  ms/op
SqlExpressionBenchmark.querySql                  none         fixedWidthNonNumeric       57           1500000      explicit           MMAP              UTF8        force  avgt    5   38.647 ±  0.943  ms/op
SqlExpressionBenchmark.querySql                  none                       always       49           1500000      explicit           MMAP              UTF8        force  avgt    5  478.346 ±  5.641  ms/op
SqlExpressionBenchmark.querySql                  none                       always       53           1500000      explicit           MMAP              UTF8        force  avgt    5  477.180 ± 14.496  ms/op
SqlExpressionBenchmark.querySql                  none                       always       57           1500000      explicit           MMAP              UTF8        force  avgt    5  478.263 ±  3.889  ms/op

note that this does seem to be a case where deferred expression processing is worse than normal vector processing, though not sure if/how we should tweak the strategies at the moment.

changes: * introduces FilteredInputBinding which adds better conditional expression processing support using a VectorMatch internally to selectively evaluate input vectors instead of precomputing all inputs, with nvl updated to take advantage of this * refactor some stuff to streamline expression vector processor implementation for simple functions like most math and logical operations with some new factory classes * update vector identifier expression processor to delegate evaluating results directly to the input binding selectors with ExprEvalBindingVector * add maxVectorSize() to ExprVectorProcessor to avoid having to pass max vector size around everywhere

processing/src/test/java/org/apache/druid/math/expr/vector/FilteredVectorInputBindingTest.java

+    matchMaker.setSelectionSize(3);
+
+    double[] doubles = filteredVectorInputBinding.getDoubleVector("double");
+    boolean[] nulls = filteredVectorInputBinding.getNullVector("double");


…onal-expressions

gianm · 2025-01-29T19:56:49Z

note that this does seem to be a case where deferred expression processing is worse than normal vector processing, though not sure if/how we should tweak the strategies at the moment.

Seems like NVL(long1, long5 + long3) slows down a lot with deferral. Did you profile it? I wonder if the reason is because regular processing is lazy (doesn't need to read long5, long3, doesn't need to compute long5 + long3)? Perhaps the tweak could be that by default we don't defer short-circuitable exprs, such as nvl, &&, etc.

changes: * introduces FilteredInputBinding which adds better conditional expression processing support using a VectorMatch internally to selectively evaluate input vectors instead of precomputing all inputs, with nvl updated to take advantage of this * refactor some stuff to streamline expression vector processor implementation for simple functions like most math and logical operations with some new factory classes * update vector identifier expression processor to delegate evaluating results directly to the input binding selectors with ExprEvalBindingVector * add maxVectorSize() to ExprVectorProcessor to avoid having to pass max vector size around everywhere

clintropolis added Performance Area - Querying labels Dec 12, 2024

github-actions bot added the Area - Segment Format and Ser/De label Dec 12, 2024

github-advanced-security bot found potential problems Dec 12, 2024

View reviewed changes

Merge remote-tracking branch 'upstream/master' into vectorize-conditi…

90282a1

…onal-expressions

gianm approved these changes Jan 29, 2025

View reviewed changes

gianm merged commit c079fa0 into apache:master Jan 29, 2025
79 checks passed

clintropolis deleted the vectorize-conditional-expressions branch January 29, 2025 21:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

expression vector processing improvements #17561

expression vector processing improvements #17561

clintropolis commented Dec 12, 2024 •

edited

Loading

gianm commented Jan 29, 2025 •

edited

Loading

expression vector processing improvements #17561

expression vector processing improvements #17561

Conversation

clintropolis commented Dec 12, 2024 • edited Loading

Description

gianm commented Jan 29, 2025 • edited Loading

clintropolis commented Dec 12, 2024 •

edited

Loading

gianm commented Jan 29, 2025 •

edited

Loading