vllm
5e6f9394 - [Attention] MLA move rotary embedding to cuda-graph region (#17668)

Commit
228 days ago
[Attention] MLA move rotary embedding to cuda-graph region (#17668) Signed-off-by: Lucas Wilkinson <lwilkinson@neuralmagic.com>
Parents
Loading