vllm
384e4d5f
- [Model Runner V2] Rebuild attention metadata before eagle decode full… (#38311)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
17 days ago
[Model Runner V2] Rebuild attention metadata before eagle decode full… (#38311) Signed-off-by: Giancarlo Delfin <gdelfin@inferact.ai>
References
#38311 - [Model Runner V2] Rebuild attention metadata before eagle decode full…
Author
TheEpicDolphin
Parents
44a65280
Loading