This repository has been archived by the owner on Mar 21, 2024. It is now read-only.
Need a macro to disable bf16 support? #478
Labels
nvbug
Has an associated internal NVIDIA NVBug.
P1: should have
Necessary, but not critical.
type: enhancement
New feature or request.
Milestone
After #306 CUB supports bf16 in some functions. Unfortunately, this means the bf16 headers (
cuda_bf16.h
,cuda_bf16.hpp
) become a dependency of Thrust/CUB. Let me explain why it is unfortunate.Normally, this would be fine in the majority of cases, as CUDA headers are also needed at compile time. But in some cases, such as within a CUDA runtime docker (which contains no headers) + using Jitify to compile CUB kernels at runtime, it is a big issue due to the missing headers.
Usually, the community approach is to bundle certain headers and redistribute them, so for example Thrust, CUB, and the fp16 headers (
cuda_fp16.h
,cuda_fp16.hpp
) can be (and are being) repackaged in many projects. The unfortunate point is that the bf16 headers are not redistributable according to the CUDA EULA, and based on an internal conversation it'd remain the case for a long while. Therefore, it would be nice if CUB could offer a macro to disable the bf16 support.The text was updated successfully, but these errors were encountered: