vllm
d7fbc6dd - [Misc] Enable V1 FP16 inference on pre-Ampere GPUs (#24022)

Commit
119 days ago
[Misc] Enable V1 FP16 inference on pre-Ampere GPUs (#24022) Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>
Author
Parents
Loading