vllm
[Kernel] FlashMLA integration
#13747
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
26
Changes
View On
GitHub
[Kernel] FlashMLA integration
#13747
youkaichao
merged 26 commits into
vllm-project:main
from
LucasWilkinson:lwilkinson/flashmla-integration
mergify
added
ci/build
LucasWilkinson
force pushed
to
3ae4a6ef
298 days ago
tlrmchlsmth
commented on 2025-02-25
tlrmchlsmth
commented on 2025-02-25
tlrmchlsmth
commented on 2025-02-25
youkaichao
commented on 2025-02-25
LucasWilkinson
marked this pull request as ready for review
297 days ago
LucasWilkinson
requested a review
from
WoosukKwon
297 days ago
mgoin
commented on 2025-02-25
LucasWilkinson
added
ready
mergify
added
needs-rebase
torch library bindings, unit tests running
c5faa922
comments
a1832e90
working in eager mode
bef305b3
format
c74a4f04
cuda-graphs still broken but closer i think
1cb71c7a
better comments
205f2bc5
remove extra files
00a1f7a3
add attribution
905be3bf
fix cuda graphs
728e0b64
cleaner build fallbacks
6681e43a
ok cuda-graphs actually fixed now I think
4a755b99
format
649a7bf1
clean up
20315610
review comment
1722bb06
fix mypy
8e64f74d
review comments
b0577928
cleanup
87499c34
fix bad logic
c47e8144
review comments
cf3e5bd5
update to latest flashMLA which supports fp16
d474a4b7
update to use fork
337f3ee5
remove unnessary include
48207c9e
add fp16 source
c215c6a3
missing symbol
02a46a3d
LucasWilkinson
force pushed
from
2fa62a9d
to
02a46a3d
297 days ago
mergify
removed
needs-rebase
improve logging, skip flashmla tests when not supported
b0552982
fix pytest errors
ca7fa2d3
mgoin
approved these changes on 2025-02-26
mgoin
added
force-merge
tlrmchlsmth
approved these changes on 2025-02-26
youkaichao
merged
f9590390
into main
296 days ago
Login to write a write a comment.
Login via GitHub
Reviewers
tlrmchlsmth
mgoin
youkaichao
simon-mo
WoosukKwon
Assignees
No one assigned
Labels
ready
ci/build
force-merge
Milestone
No milestone
Login to write a write a comment.
Login via GitHub