vllm
d7fbc6dd
- [Misc] Enable V1 FP16 inference on pre-Ampere GPUs (#24022)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
119 days ago
[Misc] Enable V1 FP16 inference on pre-Ampere GPUs (#24022) Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>
References
#24022 - [Misc] Enable V1 FP16 inference on pre-Ampere GPUs
Author
Isotr0py
Parents
5438967f
Loading