Skip to content

Navigation Menu

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

bdashore3 / flash-attention Public

forked from Dao-AILab/flash-attention

Notifications You must be signed in to change notification settings
Fork 25
Star 320

Code
Pull requests
Actions
Projects
Security
Insights

Additional navigation options

Code
Pull requests
Actions
Projects
Security
Insights

Releases: bdashore3/flash-attention

Releases · bdashore3/flash-attention

v2.7.1.post1

06 Dec 06:09

Compare

Choose a tag to compare

Loading

v2.7.1.post1 Latest

Latest

Actions: Bump Cuda

12.4

Signed-off-by: kingbri <[email protected]>

Assets 14

flash_attn-2.7.1.post1+cu124torch2.3.1cxx11abiFALSE-cp310-cp310-win_amd64.whl

58.3 MB 2024-12-06T06:10:25Z
flash_attn-2.7.1.post1+cu124torch2.3.1cxx11abiFALSE-cp311-cp311-win_amd64.whl

58.3 MB 2024-12-06T06:11:27Z
flash_attn-2.7.1.post1+cu124torch2.3.1cxx11abiFALSE-cp312-cp312-win_amd64.whl

58.3 MB 2024-12-06T06:09:36Z
flash_attn-2.7.1.post1+cu124torch2.3.1cxx11abiFALSE-cp39-cp39-win_amd64.whl

58.3 MB 2024-12-06T06:13:04Z
flash_attn-2.7.1.post1+cu124torch2.4.0cxx11abiFALSE-cp310-cp310-win_amd64.whl

58.3 MB 2024-12-06T06:10:30Z
flash_attn-2.7.1.post1+cu124torch2.4.0cxx11abiFALSE-cp311-cp311-win_amd64.whl

58.3 MB 2024-12-06T06:09:25Z
flash_attn-2.7.1.post1+cu124torch2.4.0cxx11abiFALSE-cp312-cp312-win_amd64.whl

58.3 MB 2024-12-06T06:10:50Z
flash_attn-2.7.1.post1+cu124torch2.4.0cxx11abiFALSE-cp39-cp39-win_amd64.whl

58.3 MB 2024-12-06T06:12:33Z
flash_attn-2.7.1.post1+cu124torch2.5.1cxx11abiFALSE-cp310-cp310-win_amd64.whl

58.3 MB 2024-12-06T06:09:57Z
flash_attn-2.7.1.post1+cu124torch2.5.1cxx11abiFALSE-cp311-cp311-win_amd64.whl

58.3 MB 2024-12-06T06:13:45Z
Source code (zip)

2024-12-06T03:47:15Z
Source code (tar.gz)

2024-12-06T03:47:15Z

wudidiandian, kjerk, YanWenKun, khoshsirat-udel, and fengzzhuang reacted with thumbs up emoji

All reactions

👍 5 reactions

5 people reacted

v2.7.0.post2

03 Dec 19:23

Compare

Choose a tag to compare

Loading

v2.7.0.post2

Actions: Bump Cuda

12.4

Signed-off-by: kingbri <[email protected]>

Assets 14

Loading

zhanyi1, Nelathan, Cadedempsey1, kjerk, and nitinmukesh reacted with heart emoji

All reactions

❤️ 5 reactions

5 people reacted

v2.6.3

26 Jul 00:14

Compare

Choose a tag to compare

Loading

v2.6.3

Synced to Upstream version

NOTE: Backward and Dropout are disabled meaning that this release is INFERENCE ONLY.

This is because including these features more than doubles the build time and makes the github action time itself out. Please raise an issue to the parent repo to help reduce the build times if you want these features.

Assets 17

Loading

typicaldigital, ZongWei-HUST, wh403948123, GaussianGuaicai, git-uniqity, Piscabo, newstargo, andrewiva99, turboderp, Kendo007, and 13 more reacted with heart emoji

All reactions

❤️ 23 reactions

23 people reacted

v2.6.1

12 Jul 00:06

Compare

Choose a tag to compare

Loading

v2.6.1

Actions: Switch to CUDA 12.3

Signed-off-by: kingbri <[email protected]>

Assets 11

Loading

jyoung105, search620, CNBigOrange, Godzilla0517, and wazya123 reacted with thumbs up emoji

All reactions

👍 5 reactions

5 people reacted

v2.5.9.post2

09 Jul 23:28

Compare

Choose a tag to compare

Loading

v2.5.9.post2 Pre-release

Pre-release

Quick release to add softcapping commits. Does not have backward, dropout, or alibi support.

Assets 12

Loading

All reactions

v2.5.9.post1

28 May 01:45

Compare

Choose a tag to compare

Loading

v2.5.9.post1

Actions: Clarify dispatch formatting

Signed-off-by: kingbri <[email protected]>

Assets 16

Loading

BBC-Esq, ChangxingJiang, SKBv0, aidenljk, melMass, huwhitememes, alicat22, PommesPeter, abepuentes, Piscabo, and 5 more reacted with thumbs up emoji

All reactions

👍 15 reactions

15 people reacted

v2.5.8

28 Apr 07:55

Compare

Choose a tag to compare

Loading

v2.5.8

Same as Upstream tag

Now built for only torch 2.2.2 and 2.3.0

Assets 10

Loading

yangbaoquan, onlyjokers, Eng-AliKazemi, and sirliuyang reacted with thumbs up emoji

All reactions

👍 4 reactions

4 people reacted

v2.5.6

30 Mar 20:34

Compare

Choose a tag to compare

Loading

v2.5.6

Same as Upstream tag

Assets 10

Loading

lijiajun3029, LucisVivae, yangbaoquan, wling-art, heloook, and zhou-leo reacted with thumbs up emoji

All reactions

👍 6 reactions

6 people reacted

v2.5.2

07 Feb 22:37

Compare

Choose a tag to compare

Loading

v2.5.2

Same as the upstream tag

Adds this PR to help fix building on Windows

Assets 10

Loading

biship, BBC-Esq, HarrySoteriou, AnyaCoder, yilun-lee, and VedaEistelu reacted with thumbs up emoji

All reactions

👍 6 reactions

6 people reacted

v2.4.2

03 Feb 00:29

Compare

Choose a tag to compare

Loading

v2.4.2

Inline with the parent repo's tag

Made for cuda 12.x and pytorch 2.1.2 and 2.2

v2.4.3 and up cannot be built on Windows at this time.

Assets 10

Loading

alicat22, zzj136598, biship, sheepdestroyer, and PumpkinZili reacted with heart emoji

All reactions

❤️ 5 reactions

5 people reacted

Previous 1 2 Next

Previous Next

Footer

© 2025 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.