onnxruntime
Flash attention recompute
#20603
Merged

Flash attention recompute #20603

pengwa merged 20 commits into main from pengwa/flash_attn_recompute
pengwa
pengwa flash attn recompute
45879ff5
pengwa pengwa added training
pengwa pengwa requested a review from wschin wschin 2 years ago
pengwa pengwa requested a review from zhijxu-MS zhijxu-MS 2 years ago
pengwa use json file to pass recompute plans
3c374da6
pengwa fix
f822e7b1
pengwa pengwa changed the title Flash attn recompute Flash attention recompute 2 years ago
pengwa fixes
4ee17c43
pengwa minor
11a15a0a
pengwa fix
36763054
pengwa fix build
ac44c6ce
pengwa fix win build
6b8120a2
pengwa fix win
53129178
pengwa fixes
20baf152
pengwa fix tests
c5cc3196
pengwa Merge branch 'main' of https://github.com/microsoft/onnxruntime into …
51110537
guyang3532
guyang3532 commented on 2024-05-10
guyang3532
guyang3532 commented on 2024-05-10
pengwa refinement
624adcd0
pengwa Merge branch 'main' of https://github.com/microsoft/onnxruntime into …
757ed236
guyang3532
guyang3532 commented on 2024-05-11
guyang3532
guyang3532 commented on 2024-05-11
pengwa fixes
f6ace9b8
guyang3532
guyang3532 dismissed these changes on 2024-05-13
zhijxu-MS
zhijxu-MS commented on 2024-05-13
zhijxu-MS
zhijxu-MS commented on 2024-05-13
pengwa restore the rng stage for CPU and CUDA
94d510f7
pengwa pengwa dismissed their stale review via 94d510f7 1 year ago
pengwa Merge branch 'main' of https://github.com/microsoft/onnxruntime into …
56426960
pengwa remove c++ test because it is hard to maintain it
9b94d4cc
pengwa minor
0e9d80ff
wschin
wschin approved these changes on 2024-05-21
pengwa Merge branch 'main' of https://github.com/microsoft/onnxruntime into …
31e8b97f
pengwa pengwa merged 8a98874e into main 1 year ago
pengwa pengwa deleted the pengwa/flash_attn_recompute branch 1 year ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone