vllm
67da5720
- [PERF] Speed up Qwen2.5-VL model by speed up rotary position embedding (#17973)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
213 days ago
[PERF] Speed up Qwen2.5-VL model by speed up rotary position embedding (#17973) Signed-off-by: Vadim Gimpelson <vadim.gimpelson@centml.ai>
References
#17973 - [PERF] Speed up Qwen2.5-VL model by speed up rotary position embedding const…
Author
vadiklyutiy
Parents
5c04bb8b
Loading