[Distributed]Exposure softmax_lse & seed_offset in FlashAttention #56066

ForFishes · 2023-08-08T06:54:16Z

PR types

New features

PR changes

Others

Description

[Distributed]Exposure softmax_lse & seed_offset in FlashAttention

paddle-bot · 2023-08-08T06:54:21Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

* part-3 cherry from: add check for cembedding (#55621) * part-3 fix cherry from: add check for cembedding * part-3 fix c_embedding * fix test_gpt_with_pir caused by pir * part-3 cherry from: [Distributed] Support dp/sharding overlap in virtual pp (#55651) * Add virtual pp and dp overlap * add sharding/dp overlap * add dp/vpp overlap * fix code * fix log * part-3 cherry from: [cherry-pick] Integration flash attention 2 (#56015) * [FlashAttn] add flash randomness control (#52902) * add flash randomness control * fix VLOG undefied * [WIP] Integration flash attention 2 (#55758) * Work for fa-2 padded fwd. Code to be cleaned. * Work for fa2 unpadded fwd. * Work for padded-bwd, dk get small diff on np.random.seed(0) * Anyway I pass paddle's utest, except return softmax without dropout. * Clean code. * Modify interface. * Clean code and add some check. * Easy compile for dev. * Fix ci. * Fix ci-build. * Add std c++17 option again. * Limit max job when compiling fa2. * Remove const_cast * Add fwd params, to be cleaned. * Clean code. * Add bwd params. * Clean code. * Add enforce. * Use v2.0.4 * Pass RNG state to fa2 capi * Fix review. * Add assert * Skip compile for sm less than 80. --------- Co-authored-by: Chitsing KUI <[email protected]> * part-4 cherry from: fix codestyle (#56066) * part-4 cherry from(no change): Add assert for static and other plateform (#56044) * part-4 cherry-pick from: dp and sharding coexist (#56096) * dp and sharding coexist * dp * part-4 cherry from: [Distributed] Add debug information for processgroupnccl (#56441) * add debug information * fix log * fix log * add detach for pp * part-4 cherry from: [BugFix]Fix bug in paddle.device.cdua.synchronize() (#56451) * fix bug in synchronize * fix bug in synchronize * part-4 cherry from: add fused gradient (#57048) * part-4 cherry from: [Distribtued] add eager_communication_connection for eager mode in nccl (#57517) * add eager_nccl_connection * add eager_connection * add eager_connection * part-4 cherry from: Add auto growth allocator for CUDA pinned allocator (#57625) * fix h2d bandwidth * remove useless flags * fix cherrry pick #56066 * part-5 cherry from: Add allocation debug FLAGS (#57797) * Add allocation debug FLAGS * add sync after value set * refine flags * part-5 cherry from: fix softmax backward (#57971) * part-5 cherry from: [Distributed]Optimize memory in processgroup (#58299) * optimize memory in processgroupnccl * optimize memory in processgroupnccl * optimize memory in processgroupnccl * optimize memory in processgroupnccl * part-5 cherry from: [Distributed]Add unbalance batch for virtual pp (#58383) * add unbalanced batch for vpp * add unbalanced batch for vpp * add unbalanced batch for vpp * fix * fix comments * fix kunlun compatibility issues * fix test_fused_rotary_position_embedding.py * fix allocator.h * tinyfix * fix conflicts * fix new ir translator c_embedding failure --------- Co-authored-by: ShenLiang <[email protected]> Co-authored-by: umiswing <[email protected]> Co-authored-by: Chitsing KUI <[email protected]> Co-authored-by: niuliling123 <[email protected]> Co-authored-by: liuzhenhai93 <[email protected]> Co-authored-by: sneaxiy <[email protected]>

ForFishes force-pushed the fix_flash_attn branch from d875732 to 2a07ff5 Compare August 8, 2023 07:27

fix codestyle

9f9e3d2

ForFishes force-pushed the fix_flash_attn branch from 2a07ff5 to 9f9e3d2 Compare August 8, 2023 13:31

sneaxiy approved these changes Aug 9, 2023

View reviewed changes

ForFishes merged commit caa0f37 into PaddlePaddle:incubate/new_frl Aug 9, 2023

ForFishes deleted the fix_flash_attn branch August 9, 2023 02:04

hitywt pushed a commit to hitywt/Paddle that referenced this pull request Oct 17, 2023

cherry-pick (PaddlePaddle#56066)

7fae0c8

hitywt pushed a commit to hitywt/Paddle that referenced this pull request Oct 23, 2023

fix cherry-pick (PaddlePaddle#56066)

0e162e1

hitywt pushed a commit to hitywt/Paddle that referenced this pull request Oct 26, 2023

cherry-pick from (PaddlePaddle#56066): fix codestyle

c57972a

hitywt pushed a commit to hitywt/Paddle that referenced this pull request Nov 7, 2023

cherry-pick from (PaddlePaddle#56066): fix codestyle

890d3e8

hitywt pushed a commit to hitywt/Paddle that referenced this pull request Nov 8, 2023

cherry-pick from (PaddlePaddle#56066): fix codestyle

253e422

hitywt pushed a commit to hitywt/Paddle that referenced this pull request Nov 9, 2023

cherry-pick from (PaddlePaddle#56066): fix codestyle

f296c9a

hitywt pushed a commit to hitywt/Paddle that referenced this pull request Nov 14, 2023

cherry-pick from (PaddlePaddle#56066): fix codestyle

af4f546

hitywt pushed a commit to hitywt/Paddle that referenced this pull request Nov 25, 2023

part-4 cherry from: fix codestyle (PaddlePaddle#56066)

4b268ae

hitywt pushed a commit to hitywt/Paddle that referenced this pull request Nov 28, 2023

part-4 cherry from: fix codestyle (PaddlePaddle#56066)

8eb6fd0

hitywt pushed a commit to hitywt/Paddle that referenced this pull request Nov 28, 2023

part-4 cherry from: fix codestyle (PaddlePaddle#56066)

ed7d285

hitywt pushed a commit to hitywt/Paddle that referenced this pull request Nov 28, 2023

part-4 cherry from: fix codestyle (PaddlePaddle#56066)

21d5a8f

hitywt pushed a commit to hitywt/Paddle that referenced this pull request Nov 29, 2023

fix cherrry pick PaddlePaddle#56066

97fc25d

hitywt pushed a commit to hitywt/Paddle that referenced this pull request Dec 4, 2023

part-4 cherry from: fix codestyle (PaddlePaddle#56066)

1e657ab

hitywt pushed a commit to hitywt/Paddle that referenced this pull request Dec 4, 2023

fix cherrry pick PaddlePaddle#56066

a714e7d

hitywt pushed a commit to hitywt/Paddle that referenced this pull request Dec 4, 2023

part-4 cherry from: fix codestyle (PaddlePaddle#56066)

74e90d1

hitywt pushed a commit to hitywt/Paddle that referenced this pull request Dec 4, 2023

cherry-pick pr fix codestyle (PaddlePaddle#56066)

74fe9d5

hitywt pushed a commit to hitywt/Paddle that referenced this pull request Dec 4, 2023

cherry-pick pr fix codestyle (PaddlePaddle#56066)

3fd5ddb

hitywt pushed a commit to hitywt/Paddle that referenced this pull request Dec 5, 2023

part-4 cherry from: fix codestyle (PaddlePaddle#56066)

ebc5a98

hitywt pushed a commit to hitywt/Paddle that referenced this pull request Dec 5, 2023

cherry-pick pr fix codestyle (PaddlePaddle#56066)

33c19f0

hitywt pushed a commit to hitywt/Paddle that referenced this pull request Dec 5, 2023

part-4 cherry from: fix codestyle (PaddlePaddle#56066)

05387fd

hitywt pushed a commit to hitywt/Paddle that referenced this pull request Dec 5, 2023

fix cherrry pick PaddlePaddle#56066

c333a24

hitywt pushed a commit to hitywt/Paddle that referenced this pull request Dec 5, 2023

part-4 cherry from: fix codestyle (PaddlePaddle#56066)

5219868

hitywt pushed a commit to hitywt/Paddle that referenced this pull request Dec 5, 2023

cherry-pick pr fix codestyle (PaddlePaddle#56066)

66d0c5c

hitywt pushed a commit to hitywt/Paddle that referenced this pull request Dec 5, 2023

part-4 cherry from: fix codestyle (PaddlePaddle#56066)

85ea1de

hitywt pushed a commit to hitywt/Paddle that referenced this pull request Dec 5, 2023

fix cherrry pick PaddlePaddle#56066

5affb58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Distributed]Exposure softmax_lse & seed_offset in FlashAttention #56066

[Distributed]Exposure softmax_lse & seed_offset in FlashAttention #56066

ForFishes commented Aug 8, 2023

paddle-bot bot commented Aug 8, 2023

[Distributed]Exposure softmax_lse & seed_offset in FlashAttention #56066

[Distributed]Exposure softmax_lse & seed_offset in FlashAttention #56066

Conversation

ForFishes commented Aug 8, 2023

PR types

PR changes

Description

paddle-bot bot commented Aug 8, 2023