vllm
1a971808
- Fix CUDA graph decode capture crash in AITER FlashAttention (#36042)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
55 days ago
Fix CUDA graph decode capture crash in AITER FlashAttention (#36042) Signed-off-by: Martin Yuan <myuan@meta.com> Co-authored-by: Martin Yuan <myuan@meta.com>
References
#36042 - Fix CUDA graph decode capture crash in AITER FlashAttention
Author
iseeyuan
Parents
7eb524e6
Loading