vllm
[V1][P/D] Local attention optimization for NIXL
#18170
Merged

[V1][P/D] Local attention optimization for NIXL #18170

mgoin merged 7 commits into vllm-project:main from neuralmagic:nixl-l4-opt
mgoin
mgoin Local attention optimization for NIXL
7e55a344
github-actions
mgoin Clean up a lot!
8ea467d3
mgoin Small opt
73a8272a
mgoin Fix mypy
17cc4c99
mgoin mgoin marked this pull request as ready for review 1 year ago
mgoin mgoin added v1
mgoin mgoin changed the title [WIP] Local attention optimization for NIXL [V1][P/D] Local attention optimization for NIXL 1 year ago
WoosukKwon WoosukKwon requested a review from heheda12345 heheda12345 1 year ago
WoosukKwon
heheda12345
mgoin
LucasWilkinson
LucasWilkinson commented on 2025-05-15
WoosukKwon
mgoin Add TODO to remove
af2f2642
mgoin Merge branch 'main' into nixl-l4-opt
e8fd2f1d
mgoin Bug fixes
7f0ef82c
WoosukKwon WoosukKwon added ready
tlrmchlsmth
tlrmchlsmth approved these changes on 2025-05-16
WoosukKwon
WoosukKwon approved these changes on 2025-05-16
mgoin mgoin merged fd195b19 into main 1 year ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone