vllm
b95db244 - [v1] Add real sliding window calculation to FlexAttention direct BlockMask building (#26015)

Commit
155 days ago
[v1] Add real sliding window calculation to FlexAttention direct BlockMask building (#26015) Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn> Signed-off-by: baonudesifeizhai <baonudesifeizhai@gmail.com> Co-authored-by: baonudesifeizhai <baonudesifeizhai@gmail.com>
Author
Parents
Loading