vllm
f3768686
- Quick fix for IMA with the Prefix Prefill kernel during graph capture (#25983)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
196 days ago
Quick fix for IMA with the Prefix Prefill kernel during graph capture (#25983) Signed-off-by: Sage Moore <sage@neuralmagic.com> Signed-off-by: yewentao256 <zhyanwentao@126.com>
References
#25293 - [Refactor] Refactor FP8 & INT8 Quant Folder inside `w8a8`
Author
SageMoore
Committer
yewentao256
Parents
564233d5
Loading