[PD][Nixl] Add support for hybrid SSM-FA models #36687
NickLucche
marked this pull request as ready for review 87 days ago
hacking ssm
e46626af
fix for dsv32
effaf1c3
integration tests
a8e38b1f
unit tests logical blocks
02189563
precommit
2cdc16ee
cleanup prints
700ec310
trust-remote-code
144a4d50
drop cruft
6aa0ae65
cruft
1b3988e2
toms review
ca1806fd
block len per layer cleanup
723ab1c4
remote num_blocks fix in get_block_ids
aeda1a53
hop over remote conv size for ssm
e89dcb26
generalize FA num_blocks swap
fa91b3e4
precommit
15065f5d
fix page_size for flashattn
25ade8c8
fix cross-layer page_size counting
99c4a218
update tests to have matching kv cache config
38310924
fix heterogeneous TP
e80bb2e8
fix cross_layer blocks layout
bd365b15
async scheduling issue
900225b0
fix test
208d6c80
Assignees
No one assigned
Labels
ready
v1
kv-connector
Login to write a write a comment.
Login via GitHub