You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This PR addresses #2204 and #2205.
* fixes illegal access / test coverage for mean row-wise kernel
* fixes illegal access / test coverage for stdev row-wise kernel
* modified sum kernels to utilize Kahan/Neumaier summation per thread, also increase load per thread to benefit from this
FYI, @tfeher
Authors:
- Malte Förster (https://github.com/mfoerste4)
Approvers:
- Tamas Bela Feher (https://github.com/tfeher)
URL: #2223
Describe the bug
The sum kernel does not handle underflows correctly, and that leads to inaccurate results.
Steps/Code to reproduce bug
As reported by @lijinf2:
Expected behavior
Compensate for underflow e.g. by Kahan summation or a similar method.
The text was updated successfully, but these errors were encountered: