vllm
[ROCm] Enable chunked prefill/paged attention in MLA on ROCm
#14316

Merged

[ROCm] Enable chunked prefill/paged attention in MLA on ROCm #14316

LucasWilkinson merged 4 commits into vllm-project:main from neuralmagic:sage/amd-deepseek

init

ae056e14

vincent-4 requested changes on 2025-03-05

hongxiayang added rocm

cleanup boolean logic

f1dbffb0

shajrawi approved these changes on 2025-03-06

houseroad approved these changes on 2025-03-06

LucasWilkinson requested changes on 2025-03-07

comments

8f9664db

hongxiayang approved these changes on 2025-03-10

LucasWilkinson approved these changes on 2025-03-10

LucasWilkinson enabled auto-merge (squash) 279 days ago

github-actions added ready

vincent-4 approved these changes on 2025-03-11

Merge branch 'main' of https://github.com/neuralmagic/vllm into sage/…

e9673140

LucasWilkinson merged d9f83d62 into main 277 days ago

SageMoore deleted the sage/amd-deepseek branch 277 days ago

Reviewers

LucasWilkinson

houseroad

vincent-4

hongxiayang

shajrawi

Assignees

No one assigned

Labels

rocm ready

Milestone

No milestone