vllm
1a971808 - Fix CUDA graph decode capture crash in AITER FlashAttention (#36042)

Commit
55 days ago
Fix CUDA graph decode capture crash in AITER FlashAttention (#36042) Signed-off-by: Martin Yuan <myuan@meta.com> Co-authored-by: Martin Yuan <myuan@meta.com>
Author
Parents
Loading