"Flash attention" error in a GPU enabled laptop with CUDA 12.1, Windows 11, Pytorch 2.4.0 #219

sleeplessTLV · 2024-08-14T08:06:17Z

Hi,
Installed and downloaded all, Python 3.12 and Pytorch 2.4.0+cu121 (Cuda 12.1) installed.
running the basic Jupyter notebook.
at the prediction task I get
"c:\Python312\segment-anything-2\sam2\modeling\backbones\hieradet.py:68: UserWarning: 1Torch was not compiled with flash attention. (Triggered internally at C:\actions-runner_work\pytorch\pytorch\builder\windows\pytorch\aten\src\ATen\native\transformers\cuda\sdp_utils.cpp:555.)
x = F.scaled_dot_product_attention("
Previously on the
"from sam2.sam2_image_predictor import SAM2ImagePredictor"
stage I got just a warning
"c:\Python312\segment-anything-2\sam2\modeling\sam\transformer.py:23: UserWarning: Flash Attention is disabled as it requires a GPU with Ampere (8.0) CUDA capability.
OLD_GPU, USE_FLASH_ATTN, MATH_KERNEL_ON = get_sdpa_settings()"
So can I bypass this with some setting NOT to use FLASH Attention?
thanks

sleeplessTLV · 2024-08-14T08:12:24Z

my bad, I don't know how to delete this issue, but apparently my mistake

Dashenboy · 2024-08-15T02:13:09Z

how did you fix this issue?

ronghanghu · 2024-08-15T03:44:07Z

@Dashenboy This is mainly a warning suggesting that the GPU is not supporting Flash Attention, so it will fall back to other scaled dot-product kernels. It doesn't needs fixing and you can still use SAM 2 in this case.

more details: Flash Attention is generally faster but is only fully supported with GPUs that have CUDA capabilities >= 8.0. If you GPU has a lower CUDA capability (as can be check on https://developer.nvidia.com/cuda-gpus), this warning will be printed suggesting that Flash Attention is not available for you (but you can ignore it).

skylning · 2024-11-18T01:20:05Z

My GPU is rtx4090,and it have CUDA capabilities=8.9,i also get this error

sleeplessTLV closed this as completed Aug 14, 2024

ksugar mentioned this issue Oct 25, 2024

Segmentation is pretty slow ksugar/samapi#30

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

"Flash attention" error in a GPU enabled laptop with CUDA 12.1, Windows 11, Pytorch 2.4.0 #219

"Flash attention" error in a GPU enabled laptop with CUDA 12.1, Windows 11, Pytorch 2.4.0 #219

sleeplessTLV commented Aug 14, 2024

sleeplessTLV commented Aug 14, 2024

Dashenboy commented Aug 15, 2024

ronghanghu commented Aug 15, 2024

skylning commented Nov 18, 2024

"Flash attention" error in a GPU enabled laptop with CUDA 12.1, Windows 11, Pytorch 2.4.0 #219

"Flash attention" error in a GPU enabled laptop with CUDA 12.1, Windows 11, Pytorch 2.4.0 #219

Comments

sleeplessTLV commented Aug 14, 2024

sleeplessTLV commented Aug 14, 2024

Dashenboy commented Aug 15, 2024

ronghanghu commented Aug 15, 2024

skylning commented Nov 18, 2024