vllm
[FEAT][ROCm]: Support AITER MLA on V1 Engine
#17523
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
48
Changes
View On
GitHub
[FEAT][ROCm]: Support AITER MLA on V1 Engine
#17523
DarkLight1337
merged 48 commits into
vllm-project:main
from
EmbeddedLLM:aiter-mla-v1
add AITER MLA implementation in attention backend
f782c664
remove unused arguments in aiter mla decode fwd kernel
42d5c620
add unittest for AITER MLA backend in attention selector
565a3fda
add unittest for MLA attention backend selector
645f4009
code cleaning
22c87260
update AITER version
5dc13489
Merge remote-tracking branch 'origin/main' into aiter-mla-integration
12f80237
add ck flash attn in prefill mla computation
da8c69f9
further code cleaning
1ea5718e
Merge remote-tracking branch 'origin/main' into aiter-mla-integration
681d7772
fix mypy typing errors
9ada055a
Merge remote-tracking branch 'origin/main' into aiter-mla-integration
1ceb3b97
fix mypy error on Iterable typing error
20a3f074
remove padding for v tensor in AITER MLA which improves performance
194a42a1
upgrade aiter package version
a9a02d59
only support AITER FA in AITER MLA backend to avoid latency caused by…
02a4fb32
Merge remote-tracking branch 'origin/main' into aiter-mla-integration
95213e29
add missing data types of arguments in aiter_mla_decode_fwd
6e484337
support AITER MLA backend on V1
0265f201
uncomment the required packages in common.txt
693c8709
bugfix in building decode metadata for AITER MLA decode forward pass
a5a1a54e
optimize the AITER decode metadata build
38c67c76
Merge remote-tracking branch 'origin/main' into aiter-mla-v1
74c9cb3a
bugfix caused by merging with main
643d07f4
Handle v1 AITER MLA backend in rocm platform
6171e508
update AITER MLA decode metadata build
905cec93
update AITER commit
20e769ee
Merge remote-tracking branch 'origin/main' into aiter-mla-v1
455bbf2b
update proper logging info in selected backend as well as updating at…
90daf6ed
vllmellm
requested a review
from
tlrmchlsmth
1 year ago
vllmellm
requested a review
from
WoosukKwon
1 year ago
vllmellm
requested a review
from
robertgshaw2-redhat
1 year ago
vllmellm
requested a review
from
njhill
1 year ago
vllmellm
requested a review
from
ywang96
1 year ago
vllmellm
requested a review
from
comaniac
1 year ago
vllmellm
requested a review
from
alexm-redhat
1 year ago
mergify
added
ci/build
mergify
added
v1
vllmellm
marked this pull request as draft
1 year ago
fix wrong sync merge to main
f68e9265
fix pre-commit
821f475c
Merge remote-tracking branch 'origin/main' into aiter-mla-v1
1a6ba998
add the missing line in common.py
7cc28d28
vllmellm
marked this pull request as ready for review
1 year ago
vllmellm
marked this pull request as draft
1 year ago
vllmellm
marked this pull request as ready for review
1 year ago
hongxiayang
added
rocm
fix wrong logger info message
29fc0600
houseroad
approved these changes on 2025-05-05
houseroad
added
ready
houseroad
enabled auto-merge (squash)
1 year ago
clean code and fix AITER block scaled kernel fake impl in v1 engine
7f1ed779
disabled auto-merge
1 year ago
Head branch was pushed to by a user without write access
use env variable to adjsut timeout for model execution
11a89852
remove unnecessary backend check
825f387c
make model execution timeout in envs variable rocm specific variable …
cbeb0df5
houseroad
commented on 2025-05-05
hongxiayang
approved these changes on 2025-05-02
hongxiayang
commented on 2025-05-05
fix unit-test
2218bbcf
Merge remote-tracking branch 'origin/main' into aiter-mla-v1
cb985044
bugfix to update AITER MLA V1 decode forward after sync with main
44d813f2
Update vllm/platforms/rocm.py
423c0bef
address PR comments
58d79bd5
SageMoore
requested changes on 2025-05-06
update assertion message
95644ea8
mergify
added
needs-rebase
remove env variable for model execution timeout
f41d616c
mergify
removed
needs-rebase
Merge remote-tracking branch 'origin/main' into aiter-mla-v1
56d22546
remove unnecessary warning
3ee787ec
keep model execution timeout as original value in main branch
f6884187
SageMoore
approved these changes on 2025-05-08
DarkLight1337
merged
3c9396a6
into main
1 year ago
Login to write a write a comment.
Login via GitHub
Reviewers
houseroad
SageMoore
hongxiayang
tlrmchlsmth
WoosukKwon
robertgshaw2-redhat
njhill
ywang96
comaniac
alexm-redhat
Assignees
No one assigned
Labels
rocm
ready
ci/build
v1
Milestone
No milestone
Login to write a write a comment.
Login via GitHub