Fix DeviceHistogram::Even for mixed float/int levels and sample types. #487

alliepiper · 2022-05-19T19:38:32Z

The ScaleTransform utility precomputes the reciprocal of the "sample -> bin_idx" scaling factor for floating point types as an optimization.

Trouble is, the Init method checked whether LevelT is fp while BinSelect checked SampleT. This caused the optimization to be incorrectly applied when one of these types is fp but the other is not.

Fixed this bug and simplified the implementation of ScaleTransform using cuda::std::common_type to consistently apply the optimization when either LevelT or ScaleT are fp.

Fixes #479 and #489 and adds regression tests for both.

Breaking Changes:

Fix DeviceHistogram::Even for mixed float/int levels and sample types. #487: Fixed the DeviceHistogram::HistogramEven algorithms when the samples and levels are different types (cub::DeviceHistogram::HistogramEven: Incorrect result when LevelT does not exactly match SampleT #479) and when both are integral with fractional bin sizes (cub::DeviceHistogram::HistogramEven: Incorrect result with integer levels and samples #489). These fixes introduced the following explicit restrictions:
- Both SampleT and LevelT must support common arithmetic operations.
- cuda::std::common_type<SampleT, LevelT> must have a valid definition.
- The common type must be convertible to int.
- The common type must be trivially copyable.

alliepiper · 2022-05-19T19:40:22Z

gpuCI: NVIDIA/thrust#1697

alliepiper · 2022-05-19T20:13:18Z

@Melirius This should fix your issue with DeviceHistogram. Let us know if you get a chance to test it out.

Melirius · 2022-05-21T20:17:10Z

It is nice, thanks for your work! However, it does not address another problem #489 that is rooted in the same code.

gevtushenko

Thank you for starting the work on this! It seems that no one looked at the code for a while 😄
Regarding the PR, it seems to introduce some substantial changes. I'd like us to consider all implications before merging this PR. For instance, documentation doesn't say much about SampleT and LevelT. Unlike ConterT, LevelT isn't even required to be primitive. Let's gather requirements on these types, update documentation / assert these requirements and then have another review.

cub/device/dispatch/dispatch_histogram.cuh

gevtushenko

Thanks for documenting actual requirements! A few minor fixes below.

cub/device/dispatch/dispatch_histogram.cuh

miscco · 2022-08-08T18:35:47Z

cub/detail/cpp_compatibility.cuh

+#include <cub/util_cpp_dialect.cuh>
+
+#if CUB_CPP_DIALECT >= 2017 && __cpp_if_constexpr
+#  define CUB_IF_CONSTEXPR if constexpr


Just to be sure, we cannot pass -Wno-c++17-extensions to our build flags

Asking for a friend

No -- Thrust/CUB are header only, so that would require our users to do the same.

Fixes NVIDIA#479 and NVIDIA#489.

alliepiper added type: bug: functional Does not work as intended. P1: should have Necessary, but not critical. labels May 19, 2022

alliepiper added this to the 2.0.0 milestone May 19, 2022

alliepiper requested a review from gevtushenko May 19, 2022 19:38

alliepiper added a commit to alliepiper/thrust that referenced this pull request May 19, 2022

Testing NVIDIA/cub#487.

9c28b7d

alliepiper added the testing: gpuCI in progress Started gpuCI testing. label May 19, 2022

alliepiper added testing: gpuCI passed Passed gpuCI testing. and removed testing: gpuCI in progress Started gpuCI testing. labels May 20, 2022

Melirius mentioned this pull request May 21, 2022

cub::DeviceHistogram::HistogramEven: Incorrect result with integer levels and samples #489

Closed

gevtushenko suggested changes May 24, 2022

View reviewed changes

Add CUB_IF_CONSTEXPR abstractions.

80f5878

alliepiper added the release: breaking change Include in "Breaking Changes" section of release notes. label Aug 5, 2022

alliepiper linked an issue Aug 5, 2022 that may be closed by this pull request

cub::DeviceHistogram::HistogramEven: Incorrect result with integer levels and samples #489

Closed

alliepiper force-pushed the histogram_mixed_type/gh.479 branch from b0b8adb to ff1b254 Compare August 5, 2022 21:04

alliepiper added a commit to alliepiper/thrust that referenced this pull request Aug 5, 2022

Testing NVIDIA/cub#487.

0c934d7

alliepiper added a commit to alliepiper/thrust that referenced this pull request Aug 5, 2022

Testing NVIDIA/cub#487.

ce9a79a

alliepiper added testing: gpuCI in progress Started gpuCI testing. and removed testing: gpuCI passed Passed gpuCI testing. labels Aug 5, 2022

alliepiper requested a review from gevtushenko August 5, 2022 21:06

gevtushenko approved these changes Aug 5, 2022

View reviewed changes

cub/device/dispatch/dispatch_histogram.cuh Outdated Show resolved Hide resolved

cub/device/dispatch/dispatch_histogram.cuh Outdated Show resolved Hide resolved

alliepiper force-pushed the histogram_mixed_type/gh.479 branch from ff1b254 to c765c89 Compare August 8, 2022 18:13

alliepiper added a commit to alliepiper/thrust that referenced this pull request Aug 8, 2022

Testing NVIDIA/cub#487.

0f88d64

miscco reviewed Aug 8, 2022

View reviewed changes

alliepiper force-pushed the histogram_mixed_type/gh.479 branch from c765c89 to 16288ce Compare August 8, 2022 20:42

alliepiper added a commit to alliepiper/thrust that referenced this pull request Aug 8, 2022

Testing NVIDIA/cub#487.

36406a8

alliepiper force-pushed the histogram_mixed_type/gh.479 branch from 16288ce to cf20c71 Compare August 9, 2022 18:53

alliepiper added a commit to alliepiper/thrust that referenced this pull request Aug 9, 2022

Testing NVIDIA/cub#487.

c7aaf7d

alliepiper force-pushed the histogram_mixed_type/gh.479 branch from cf20c71 to 24b675d Compare August 9, 2022 21:22

alliepiper added a commit to alliepiper/thrust that referenced this pull request Aug 9, 2022

Testing NVIDIA/cub#487.

af2f527

Fix DeviceHistogram::Even for mixed float/int levels and sample types.

3b3bf92

Fixes NVIDIA#479 and NVIDIA#489.

alliepiper force-pushed the histogram_mixed_type/gh.479 branch from 24b675d to 3b3bf92 Compare August 9, 2022 23:45

alliepiper added a commit to alliepiper/thrust that referenced this pull request Aug 9, 2022

Testing NVIDIA/cub#487.

f5e8c3c

alliepiper merged commit 5e990ba into NVIDIA:main Aug 10, 2022

alliepiper deleted the histogram_mixed_type/gh.479 branch August 10, 2022 20:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix DeviceHistogram::Even for mixed float/int levels and sample types. #487

Fix DeviceHistogram::Even for mixed float/int levels and sample types. #487

alliepiper commented May 19, 2022 •

edited

Loading

alliepiper commented May 19, 2022

alliepiper commented May 19, 2022

Melirius commented May 21, 2022

gevtushenko left a comment

gevtushenko left a comment

miscco Aug 8, 2022

alliepiper Aug 9, 2022

Fix DeviceHistogram::Even for mixed float/int levels and sample types. #487

Fix DeviceHistogram::Even for mixed float/int levels and sample types. #487

Conversation

alliepiper commented May 19, 2022 • edited Loading

Breaking Changes:

alliepiper commented May 19, 2022

alliepiper commented May 19, 2022

Melirius commented May 21, 2022

gevtushenko left a comment

Choose a reason for hiding this comment

gevtushenko left a comment

Choose a reason for hiding this comment

miscco Aug 8, 2022

Choose a reason for hiding this comment

alliepiper Aug 9, 2022

Choose a reason for hiding this comment

alliepiper commented May 19, 2022 •

edited

Loading