refine decimal multiply, avoid cast to wider type #6331

mingmwang · 2023-05-11T05:00:54Z

Which issue does this PR close?

Closes #6278, bring the performance back.

Rationale for this change

For decimal multiply, avoid casting to the wider type, decimals can multiply decimals directly.

What changes are included in this PR?

Are these changes tested?

Main branch:

Running benchmarks with the following options: DataFusionBenchmarkOpt { query: Some(1), debug: false, iterations: 3, partitions: 1, batch_size: 8192, path: "./parquet_data", file_format: "parquet", mem_table: false, output_path: None, disable_statistics: true }
Query 1 iteration 0 took 1716.3 ms and returned 4 rows
Query 1 iteration 1 took 1697.0 ms and returned 4 rows
Query 1 iteration 2 took 1694.3 ms and returned 4 rows
Query 1 avg time: 1702.52 ms

Branch 23 (Tag 23.0.0):

Running benchmarks with the following options: DataFusionBenchmarkOpt { query: Some(1), debug: false, iterations: 3, partitions: 1, batch_size: 8192, path: "./parquet_data", file_format: "parquet", mem_table: false, output_path: None, disable_statistics: true, enable_scheduler: false }
Query 1 iteration 0 took 864.2 ms and returned 4 rows
Query 1 iteration 1 took 842.0 ms and returned 4 rows
Query 1 iteration 2 took 838.7 ms and returned 4 rows
Query 1 avg time: 848.29 ms

This PR:

Running benchmarks with the following options: DataFusionBenchmarkOpt { query: Some(1), debug: false, iterations: 3, partitions: 1, batch_size: 8192, path: "./parquet_data", file_format: "parquet", mem_table: false, output_path: None, disable_statistics: true }
Query 1 iteration 0 took 539.1 ms and returned 4 rows
Query 1 iteration 1 took 514.7 ms and returned 4 rows
Query 1 iteration 2 took 511.1 ms and returned 4 rows
Query 1 avg time: 521.61 ms

Are there any user-facing changes?

datafusion/expr/src/type_coercion/binary.rs

mingmwang · 2023-05-11T06:15:01Z

override def resultDecimalType(p1: Int, s1: Int, p2: Int, s2: Int): DecimalType = {
    val resultScale = s1 + s2
    val resultPrecision = p1 + p2 + 1
    if (allowPrecisionLoss) {
      DecimalType.adjustPrecisionScale(resultPrecision, resultScale)
    } else {
      DecimalType.bounded(resultPrecision, resultScale)
    }
  }

  private lazy val numeric = TypeUtils.getNumeric(dataType, failOnError)

  protected override def nullSafeEval(input1: Any, input2: Any): Any = dataType match {
    case DecimalType.Fixed(precision, scale) =>
      checkDecimalOverflow(numeric.times(input1, input2).asInstanceOf[Decimal], precision, scale)
    case _: IntegerType if failOnError =>
      MathUtils.multiplyExact(
        input1.asInstanceOf[Int],
        input2.asInstanceOf[Int],
        getContextOrNull())
    case _: LongType if failOnError =>
      MathUtils.multiplyExact(
        input1.asInstanceOf[Long],
        input2.asInstanceOf[Long],
        getContextOrNull())
    case _ => numeric.times(input1, input2)
  }

mingmwang · 2023-05-11T06:26:16Z

override def checkInputDataTypes(): TypeCheckResult = (left.dataType, right.dataType) match {
    case (l: DecimalType, r: DecimalType) if inputType.acceptsType(l) && inputType.acceptsType(r) =>
      // We allow decimal type inputs with different precision and scale, and use special formulas
      // to calculate the result precision and scale.
      TypeCheckResult.TypeCheckSuccess
    case _ => super.checkInputDataTypes()
  }

mingmwang · 2023-05-11T06:38:44Z

And existing UTs show the result is correct.

query T
select arrow_typeof(c1*c5) from decimal_simple limit 1;
----
Decimal128(23, 13)


query R rowsort
select c1*c5 from decimal_simple;
----
0.00000000014
0.00000000033
0.00000000038
0.0000000005
0.00000000096
0.00000000105
0.0000000016
0.0000000016
0.00000000165
0.00000000176
0.00000000176
0.0000000026
0.0000000034
0.0000000039
0.000000005

datafusion/physical-expr/src/expressions/binary/kernels_arrow.rs

viirya

Good catch. This fixed the tpch-q1 performance. But for the root cause, we still need to improve the i256 operations. Otherwise for queries containing decimal array to array multiplying, it still suffers performance issue.

mingmwang · 2023-05-12T00:51:14Z

Good catch. This fixed the tpch-q1 performance. But for the root cause, we still need to improve the i256 operations. Otherwise for queries containing decimal array to array multiplying, it still suffers performance issue.

Yes, you are right. We still need to address the i256 operations performance, otherwise when it runs into the related logic, the performance is still not good.

alamb

Fixing a bug by deleting code ❤️

viirya · 2023-05-28T18:05:27Z

FYI, I'm going to improve the i256 operations at the upstream: apache/arrow-rs#4303.

refine decimal multiply, avoid cast to wider type

c38c0a6

mingmwang requested review from viirya and alamb May 11, 2023 05:01

github-actions bot added logical-expr Logical plan and expressions physical-expr Physical Expressions labels May 11, 2023

mingmwang requested a review from liukun4515 May 11, 2023 05:01

mingmwang added 2 commits May 11, 2023 13:29

fix clippy

61c8bb2

fix fmt

b021d49

viirya reviewed May 11, 2023

View reviewed changes

datafusion/expr/src/type_coercion/binary.rs Show resolved Hide resolved

viirya reviewed May 11, 2023

View reviewed changes

datafusion/physical-expr/src/expressions/binary/kernels_arrow.rs Show resolved Hide resolved

viirya approved these changes May 11, 2023

View reviewed changes

viirya reviewed May 11, 2023

View reviewed changes

liukun4515 approved these changes May 12, 2023

View reviewed changes

alamb reviewed May 12, 2023

View reviewed changes

andygrove merged commit 063f99f into apache:main May 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refine decimal multiply, avoid cast to wider type #6331

refine decimal multiply, avoid cast to wider type #6331

mingmwang commented May 11, 2023 •

edited

Loading

mingmwang commented May 11, 2023 •

edited

Loading

mingmwang commented May 11, 2023

mingmwang commented May 11, 2023

viirya left a comment

mingmwang commented May 12, 2023

alamb left a comment

viirya commented May 28, 2023

refine decimal multiply, avoid cast to wider type #6331

refine decimal multiply, avoid cast to wider type #6331

Conversation

mingmwang commented May 11, 2023 • edited Loading

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

mingmwang commented May 11, 2023 • edited Loading

mingmwang commented May 11, 2023

mingmwang commented May 11, 2023

viirya left a comment

Choose a reason for hiding this comment

mingmwang commented May 12, 2023

alamb left a comment

Choose a reason for hiding this comment

viirya commented May 28, 2023

mingmwang commented May 11, 2023 •

edited

Loading

mingmwang commented May 11, 2023 •

edited

Loading