fix: decimal conversion looses value on lower precision #6836

himadripal · 2024-12-04T17:28:34Z

Which issue does this PR close?

Closes #.

Rationale for this change

Decimal128 to Decimal128 with smaller precision produces incorrect results in some cases.

What changes are included in this PR?

It adds a decimal validation after conversion to check if the converted result can fit into the specified precision and scale

Are there any user-facing changes?

…on overflow.

himadripal · 2024-12-04T17:29:53Z

@andygrove @viirya @alamb please take a look.

andygrove · 2024-12-04T17:34:46Z

arrow-cast/src/cast/decimal.rs

@@ -112,8 +112,19 @@ where
    };

    Ok(match cast_options.safe {


It seems strange to match on a boolean rather than just using an if statement. I know this is how the existing code was, but perhaps we could improve this while we are here.

andygrove

LGTM. The logic now matches the logic in cast_floating_point_to_decimal128. Thanks @himadripal

andygrove · 2024-12-04T17:42:23Z

arrow-cast/src/cast/mod.rs

+        let result = cast_with_options(&array, &output_type, &options);
+
+        assert_eq!(result.unwrap_err().to_string(),
+                   "InvalidArgumentError(123456789 is too large to store in a Decimal128 of precision 6. Max is 999999)");


The error message format needs updating:

left: "Invalid argument error: 123456789 is too large to store in a Decimal256 of precision 6. Max is 999999" right: "InvalidArgumentError(123456789 is too large to store in a Decimal256 of precision 6. Max is 999999)"

andygrove · 2024-12-04T17:48:56Z

There is a regression in test_cast_decimal128_to_decimal256_negative. The new validation check is correctly throwing an error, but it looks like we also need to add this validation when creating decimal arrays since the current test is creating invalid arrays before the cast:

let array = vec![Some(i128::MAX), Some(i128::MIN)];
let input_decimal_array = create_decimal_array(array, 10, 3).unwrap();

I would expect this to fail, so we probably need to add the same validation there.

andygrove · 2024-12-04T18:02:36Z

There is a regression in test_cast_decimal128_to_decimal256_negative. The new validation check is correctly throwing an error, but it looks like we also need to add this validation when creating decimal arrays since the current test is creating invalid arrays before the cast:
let array = vec![Some(i128::MAX), Some(i128::MIN)];
let input_decimal_array = create_decimal_array(array, 10, 3).unwrap();
I would expect this to fail, so we probably need to add the same validation there.

On second thoughts, it seems it was an intentional design decision not to validate this on array creation. Instead, an array.validate_decimal_precision method can optionally be called on the array to validate it after creation, so we should probably just update the test as needed (@alamb @tustvold perhaps you could correct me if I am wrong about this).

tustvold

Have we run any benchmarks (I'm not sure if any actually exist) to confirm this doesn't significantly regress performance.

It seems unfortunate to be always performing overflow checks, when in many cases it should be possible to prove that precision overflow can't occur and need not be checked for

tustvold · 2024-12-04T18:33:56Z

arrow-cast/src/cast/decimal.rs

@@ -156,19 +179,9 @@ where
    T::Native: DecimalCast + ArrowNativeTypeOp,
 {
    let array: PrimitiveArray<T> = match input_scale.cmp(&output_scale) {


This could be changed to <=

findepi · 2024-12-04T20:25:18Z

arrow-cast/src/cast/decimal.rs

+            array.try_unary(|x| {
+                f(x).ok_or_else(|| error(x))
+                    .and_then(|v|{
+                        O:: validate_decimal_precision(v, output_precision).map(|_| v)


Why not do this when computing the value (the code above), and only when this can fail?
(for example widening precision cannot fail, so no need to spend cycles on validating the produced values)

I thought about it, but then there are other examples where is it done this way, so kept it there. We can also do this as part of first error check point - inside this method

let error = cast_decimal_to_decimal_error::<I, O>(output_precision, output_scale);

andygrove · 2024-12-04T21:11:44Z

Have we run any benchmarks (I'm not sure if any actually exist) to confirm this doesn't significantly regress performance.

It seems unfortunate to be always performing overflow checks, when in many cases it should be possible to prove that precision overflow can't occur and need not be checked for

I'll create a separate PR (probably tomorrow) to add some criterion benchmarks

himadripal · 2024-12-05T01:24:51Z

There is a regression in test_cast_decimal128_to_decimal256_negative. The new validation check is correctly throwing an error, but it looks like we also need to add this validation when creating decimal arrays since the current test is creating invalid arrays before the cast:
let array = vec![Some(i128::MAX), Some(i128::MIN)];
let input_decimal_array = create_decimal_array(array, 10, 3).unwrap();
I would expect this to fail, so we probably need to add the same validation there.
On second thoughts, it seems it was an intentional design decision not to validate this on array creation. Instead, an array.validate_decimal_precision method can optionally be called on the array to validate it after creation, so we should probably just update the test as needed (@alamb @tustvold perhaps you could correct me if I am wrong about this).

Changed the test to pass.

viirya · 2024-12-05T03:20:19Z

arrow-cast/src/cast/decimal.rs

-            // input_scale < output_scale
+    let array: PrimitiveArray<T> = if input_scale <= output_scale {
+            // input_scale <= output_scale
+            // the scale doesn't change, but precision may change and cause overflow


Why don't we also check precision and skip convert_to_bigger_or_equal_scale_decimal if it won't overflow?

won't it still has to go through the convert_to_bigger_or_equal_scale_decimal to add the zero at the end for bigger precision conversion. IMO, we can only skip the call if precision is less and scales are equal. Please correct me here.
we can do check at the beginning like this

if (array.validate_decimal_precision().is_err())

but we need to return null for safe=true and throw error for safe=false
is that what you are suggesting?

For example, original code simply returns the original array for same scale. If the new precision is bigger, I think we can still do that, no?

lets assume, we are converting 12.345 from (5, 3) to (6,3), ~~then it needs to be 123450 - if we do array.clone() as per original code- wondering how it will add the 0 at the end.~~. Then it should still be 12.345

If this is correct, then we can return original array if precision is bigger and scale is equal.

select arrow_typeof(cast(cast(1.23 as decimal(10,3)) as decimal(12,3))), cast(cast(1.23 as decimal(10,3)) as decimal(12,3)); ---- Decimal128(12, 3) 1.23

12345 in (5, 3) is 12.345. When casting to (6, 3), it is still 12345, why it needs to be 123450? 123450 in (6, 3) is 123.45. If I understand it correctly.

Yes, you are right. fn signature only has output_precision, I can get the input_precision from the calling fn, it's available there and change the signature to include input_precision. would that be fine?

Done. @viirya please check.

…eded. revert whitespace changes formatting check

himadripal · 2024-12-06T15:09:52Z

Can anyone please let the build run - workflows waiting for approval.

andygrove · 2024-12-06T20:52:55Z

Have we run any benchmarks (I'm not sure if any actually exist) to confirm this doesn't significantly regress performance.

It seems unfortunate to be always performing overflow checks, when in many cases it should be possible to prove that precision overflow can't occur and need not be checked for

I created a simple benchmark for decimal casting in #6850.

Unsurprisingly, validating that the results are correct is slower than not validating the results.

before

cast_decimal            time:   [45.281 ns 45.549 ns 45.871 ns]

after (this PR)

cast_decimal            time:   [247.97 ns 248.78 ns 249.78 ns]
                        change: [+435.06% +439.47% +443.15%] (p = 0.00 < 0.05)

We currently have the config option of safe on or off:

pub struct CastOptions<'a> {
    /// how to handle cast failures, either return NULL (safe=true) or return ERR (safe=false)
    pub safe: bool,

So, yes, it is a performance regression, but the previous behavior was incorrect. This PR now makes this work as advertised.

@tustvold Is there a use case we need to support for faster casts without validating results per the CastOptions?

tustvold · 2024-12-07T14:40:59Z

@tustvold Is there a use case we need to support for faster casts without validating results per the CastOptions?

No, the cast should be checked, apologies if that wasn't clear, my concern was the PR as originally formulated blindly performed the checked conversion regardless of the input type, even when the cast was increasing the precision. Given the whole purpose of tracking precision is to avoid overflow checks, it seemed a little off.

I'll try to find some time to take another look at this PR as from a quick scan this looks to have been addressed

tustvold

This is still overly pessimistic, for example, if I increase both the precision and scale by the same amount I shouldn't need to perform any checks.

However, the kernel is already fallible (try_unary/unary_opt vs unary) and so this kernel already won't be vectorizing properly, and so it isn't like we're regressing a highly optimised kernel here.

I've therefore instead opted to file #6877 and if people care they can action it.

* decimal conversion looses value on lower precision, throws error now on overflow. * fix review comments and fix formatting. * for simple case of equal scale and bigger precision, no conversion needed. revert whitespace changes formatting check --------- Co-authored-by: himadripal <[email protected]>

* decimal conversion looses value on lower precision, throws error now on overflow. * fix review comments and fix formatting. * for simple case of equal scale and bigger precision, no conversion needed. revert whitespace changes formatting check --------- Co-authored-by: Himadri Pal <[email protected]> Co-authored-by: himadripal <[email protected]>

decimal conversion looses value on lower precision, throws error now …

076cee4

…on overflow.

github-actions bot added the arrow Changes to the arrow crate label Dec 4, 2024

andygrove reviewed Dec 4, 2024

View reviewed changes

himadripal changed the title ~~decimal conversion looses value on lower precision, throws error now …~~ fix: decimal conversion looses value on lower precision Dec 4, 2024

andygrove approved these changes Dec 4, 2024

View reviewed changes

andygrove reviewed Dec 4, 2024

View reviewed changes

tustvold reviewed Dec 4, 2024

View reviewed changes

findepi reviewed Dec 4, 2024

View reviewed changes

fix review comments and fix formatting.

264c29b

viirya reviewed Dec 5, 2024

View reviewed changes

for simple case of equal scale and bigger precision, no conversion ne…

6c50fe3

…eded. revert whitespace changes formatting check

himadripal force-pushed the fix_decimal_conversion_bug branch from 68b0f68 to 6c50fe3 Compare December 5, 2024 17:32

andygrove mentioned this pull request Dec 6, 2024

chore: add cast_decimal benchmark #6850

Merged

tustvold mentioned this pull request Dec 12, 2024

Optimise Decimal Casting #6877

Open

tustvold approved these changes Dec 12, 2024

View reviewed changes

tustvold merged commit eb7ab83 into apache:main Dec 12, 2024
27 checks passed

This was referenced Dec 13, 2024

Improve decimal casting performance apache/datafusion-comet#1168

Open

Release arrow-rs / parquet minor version 53.4.0 (Jan 2025) #6887

Closed

himadripal mentioned this pull request Dec 27, 2024

Implement Spark-compatible cast between decimals with different precision and scale apache/datafusion-comet#375

Open

andygrove mentioned this pull request Jan 3, 2025

[53.0.0_maintenance] fix: decimal conversion looses value on lower precision (#6836) #6936

Merged

himadripal mentioned this pull request Jan 4, 2025

minor: fix test and remove println in tests #6935

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: decimal conversion looses value on lower precision #6836

fix: decimal conversion looses value on lower precision #6836

himadripal commented Dec 4, 2024 •

edited

Loading

himadripal commented Dec 4, 2024

andygrove Dec 4, 2024

andygrove left a comment

andygrove Dec 4, 2024

andygrove commented Dec 4, 2024

andygrove commented Dec 4, 2024 •

edited

Loading

tustvold left a comment

tustvold Dec 4, 2024

findepi Dec 4, 2024

himadripal Dec 5, 2024

andygrove commented Dec 4, 2024

himadripal commented Dec 5, 2024 •

edited

Loading

viirya Dec 5, 2024 •

edited

Loading

himadripal Dec 5, 2024 •

edited

Loading

viirya Dec 5, 2024 •

edited

Loading

himadripal Dec 5, 2024 •

edited

Loading

himadripal Dec 5, 2024 •

edited

Loading

viirya Dec 5, 2024

himadripal Dec 5, 2024 •

edited

Loading

viirya Dec 5, 2024

himadripal Dec 5, 2024

himadripal commented Dec 6, 2024 •

edited

Loading

andygrove commented Dec 6, 2024

tustvold commented Dec 7, 2024 •

edited

Loading

tustvold left a comment

fix: decimal conversion looses value on lower precision #6836

fix: decimal conversion looses value on lower precision #6836

Conversation

himadripal commented Dec 4, 2024 • edited Loading

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are there any user-facing changes?

himadripal commented Dec 4, 2024

Choose a reason for hiding this comment

andygrove left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

andygrove commented Dec 4, 2024

andygrove commented Dec 4, 2024 • edited Loading

tustvold left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

andygrove commented Dec 4, 2024

himadripal commented Dec 5, 2024 • edited Loading

viirya Dec 5, 2024 • edited Loading

Choose a reason for hiding this comment

himadripal Dec 5, 2024 • edited Loading

Choose a reason for hiding this comment

viirya Dec 5, 2024 • edited Loading

Choose a reason for hiding this comment

himadripal Dec 5, 2024 • edited Loading

Choose a reason for hiding this comment

himadripal Dec 5, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

himadripal Dec 5, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

himadripal commented Dec 6, 2024 • edited Loading

andygrove commented Dec 6, 2024

before

after (this PR)

tustvold commented Dec 7, 2024 • edited Loading

tustvold left a comment

Choose a reason for hiding this comment

himadripal commented Dec 4, 2024 •

edited

Loading

andygrove commented Dec 4, 2024 •

edited

Loading

himadripal commented Dec 5, 2024 •

edited

Loading

viirya Dec 5, 2024 •

edited

Loading

himadripal Dec 5, 2024 •

edited

Loading

viirya Dec 5, 2024 •

edited

Loading

himadripal Dec 5, 2024 •

edited

Loading

himadripal Dec 5, 2024 •

edited

Loading

himadripal Dec 5, 2024 •

edited

Loading

himadripal commented Dec 6, 2024 •

edited

Loading

tustvold commented Dec 7, 2024 •

edited

Loading