vllm
[Attention] Flash MLA for V1
#13867
Merged

[Attention] Flash MLA for V1 #13867

LucasWilkinson
[Attention] MLA support for V1
998803e7
LucasWilkinson torch library bindings, unit tests running
12a5221e
LucasWilkinson comments
38076021
LucasWilkinson working in eager mode
955cead8
LucasWilkinson format
1d5c8680
LucasWilkinson cuda-graphs still broken but closer i think
eae47876
LucasWilkinson better comments
c79927d1
LucasWilkinson remove extra files
37c4f9e6
LucasWilkinson add attribution
084b031b
LucasWilkinson fix cuda graphs
07a9bad5
LucasWilkinson cleaner build fallbacks
4dc8c352
LucasWilkinson ok cuda-graphs actually fixed now I think
a6c36cc2
LucasWilkinson format
3ae4a6ef
LucasWilkinson fix deepseek-v2
a6213a48
LucasWilkinson Merge branch 'lwilkinson/fix-deepseek-v2' into lwilkinson/flashmla-in…
68895a20
LucasWilkinson clean up
5e7cd970
LucasWilkinson Merge remote-tracking branch 'origin/main' into lwilkinson/flashmla-i…
8bb3bdc9
LucasWilkinson review comment
d4399691
LucasWilkinson fix mypy
aa42226e
LucasWilkinson review comments
d18261cf
LucasWilkinson cleanup
4c08a0a8
LucasWilkinson fix bad logic
07332bfe
LucasWilkinson review comments
c4434d9e
LucasWilkinson update to latest flashMLA which supports fp16
f570fe0e
LucasWilkinson update to use fork
0bbcf279
LucasWilkinson remove unnessary include
177ee292
LucasWilkinson add fp16 source
642456fe
LucasWilkinson missing symbol
2fa62a9d
[Attention] MLA support for V1
4b7ef4d0
LucasWilkinson Merge remote-tracking branch 'yang/mla-v1' into lwilkinson/flash-mla-v1
0ae026a5
address review feedback
23c780ff
restore to use attn_module.head_size
29c06c7b
LucasWilkinson wip v1 FlashMLA
5f8526b6
LucasWilkinson Merge remote-tracking branch 'yang/mla-v1' into lwilkinson/flash-mla-v1
f9551648
github-actions
mergify mergify added ci/build
mergify mergify added v1
mergify
mergify mergify added needs-rebase
[Attention] MLA support for V1
04c8db41
address review feedback
a456e058
restore to use attn_module.head_size
867d2ede
included more fixes from Lucas
8715cfbe
addressed feedback from Woosuk Kwon
6bf7bfbc
LucasWilkinson LucasWilkinson force pushed from f9551648 to c63464de 288 days ago
mergify mergify removed needs-rebase
LucasWilkinson Merge remote-tracking branch 'yang/mla-v1' into lwilkinson/flash-mla-v1
dab8ad6d
LucasWilkinson LucasWilkinson force pushed from c63464de to dab8ad6d 288 days ago
LucasWilkinson Merge remote-tracking branch 'origin/main' into lwilkinson/flash-mla-v1
67b2b628
LucasWilkinson cleanup
e6e57899
LucasWilkinson LucasWilkinson marked this pull request as ready for review 287 days ago
LucasWilkinson LucasWilkinson requested a review from WoosukKwon WoosukKwon 287 days ago
LucasWilkinson LucasWilkinson requested a review from robertgshaw2-redhat robertgshaw2-redhat 287 days ago
LucasWilkinson LucasWilkinson requested a review from njhill njhill 287 days ago
LucasWilkinson LucasWilkinson requested a review from ywang96 ywang96 287 days ago
LucasWilkinson LucasWilkinson requested a review from comaniac comaniac 287 days ago
LucasWilkinson LucasWilkinson requested a review from alexm-redhat alexm-redhat 287 days ago
LiuXiaoxuanPKU
LucasWilkinson
mgoin mgoin added ready
mgoin
mgoin approved these changes on 2025-02-27
LucasWilkinson
mgoin mgoin enabled auto-merge (squash) 287 days ago
mgoin mgoin merged 2e94b9cf into main 287 days ago
samuellees
zuozi2810
LucasWilkinson
LucasWilkinson
samuellees

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone