Need a macro to disable bf16 support? #478

leofang · 2022-05-11T04:36:47Z

After #306 CUB supports bf16 in some functions. Unfortunately, this means the bf16 headers (cuda_bf16.h, cuda_bf16.hpp) become a dependency of Thrust/CUB. Let me explain why it is unfortunate.

Normally, this would be fine in the majority of cases, as CUDA headers are also needed at compile time. But in some cases, such as within a CUDA runtime docker (which contains no headers) + using Jitify to compile CUB kernels at runtime, it is a big issue due to the missing headers.

Usually, the community approach is to bundle certain headers and redistribute them, so for example Thrust, CUB, and the fp16 headers (cuda_fp16.h, cuda_fp16.hpp) can be (and are being) repackaged in many projects. The unfortunate point is that the bf16 headers are not redistributable according to the CUDA EULA, and based on an internal conversation it'd remain the case for a long while. Therefore, it would be nice if CUB could offer a macro to disable the bf16 support.

The text was updated successfully, but these errors were encountered:

gevtushenko · 2022-05-11T15:42:04Z

@leofang thank you for reporting the issue! Please, check if the following PR addresses your issue.

leofang · 2022-05-11T15:43:42Z

Relevant: nvbugs 3641496

leofang mentioned this issue May 11, 2022

Adding support for segmented sorting cupy/cupy#6699

Open

gevtushenko self-assigned this May 11, 2022

gevtushenko mentioned this issue May 11, 2022

Add option to disable BF16 support #480

Merged

alliepiper added type: enhancement New feature or request. nvbug Has an associated internal NVIDIA NVBug. P1: should have Necessary, but not critical. labels May 11, 2022

alliepiper added this to the 2.0.0 milestone May 11, 2022

alliepiper linked a pull request May 11, 2022 that will close this issue

Add option to disable BF16 support #480

Merged

gevtushenko closed this as completed in #480 Jun 4, 2022

gevtushenko mentioned this issue Feb 20, 2023

Cleanup CTK version checks #630

Merged

gevtushenko mentioned this issue Nov 8, 2023

Fix reduce to match the documentation and use numeric limits NVIDIA/cccl#920

Closed

peizhang-cn mentioned this issue Jun 13, 2023

Compile error: cannot find cuda_bf16.h NVIDIA/cccl#930

Open

leofang mentioned this issue Sep 10, 2023

Make CuPy run without headers under CUDA 12.2 for Windows cupy/cupy#7776

Closed

leofang mentioned this issue Nov 22, 2023

cuda::std::complex specializations for half and bfloat NVIDIA/cccl#1140

Merged

3 tasks

leofang mentioned this issue Feb 2, 2024

Error when including mma.h: instance of overloaded function "__half::__half" matches the specified type cupy/cupy#8146

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Need a macro to disable bf16 support? #478

Need a macro to disable bf16 support? #478

leofang commented May 11, 2022

gevtushenko commented May 11, 2022

leofang commented May 11, 2022

Need a macro to disable bf16 support? #478

Need a macro to disable bf16 support? #478

Comments

leofang commented May 11, 2022

gevtushenko commented May 11, 2022

leofang commented May 11, 2022