vllm
5e6f9394
- [Attention] MLA move rotary embedding to cuda-graph region (#17668)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
228 days ago
[Attention] MLA move rotary embedding to cuda-graph region (#17668) Signed-off-by: Lucas Wilkinson <lwilkinson@neuralmagic.com>
References
#17668 - [Attention] MLA move rotary embedding to cuda-graph region
Author
LucasWilkinson
Parents
760e3ecc
Loading