`try_to_fold_vector_reduce<Call>` has incorrect behavior #6883

rootjalex · 2022-07-25T16:24:30Z

In CodeGen_LLVM, we call try_to_fold_vector_reduce<Call> on saturating_add or saturating_sub calls, while not providing information as to whether or not the accumulation is an addition or a subtraction:

https://github.com/halide/Halide/blob/11a049c3967a277173e288ffd802f08ce1a1b78e/src/CodeGen_LLVM.cpp#L2835-#L2839

This seems like incorrect behavior - I noticed this while restructuring CodeGen_X86 into separate optimization and code generation passes, because it appears that the accumulating saturating dot product instructions should trigger on both of these patterns:

saturating_sub(wild_i32x, VectorReduce(SaturatingAdd, factor=4, widening_mul(wild_i16x, wild_i16x)))

saturating_add(wild_i32x, VectorReduce(SaturatingAdd, factor=4, widening_mul(wild_i16x, wild_i16x)))

The text was updated successfully, but these errors were encountered:

rootjalex added the bug label Jul 25, 2022

rootjalex mentioned this issue Jul 29, 2022

Don't try to fold saturating_sub of VectorReduce #6896

Merged

steven-johnson closed this as completed in #6896 Aug 1, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`try_to_fold_vector_reduce<Call>` has incorrect behavior #6883

`try_to_fold_vector_reduce<Call>` has incorrect behavior #6883

rootjalex commented Jul 25, 2022

try_to_fold_vector_reduce<Call> has incorrect behavior #6883

try_to_fold_vector_reduce<Call> has incorrect behavior #6883

Comments

rootjalex commented Jul 25, 2022

`try_to_fold_vector_reduce<Call>` has incorrect behavior #6883

`try_to_fold_vector_reduce<Call>` has incorrect behavior #6883