vllm
[Misc] Enable V1 FP16 inference on pre-Ampere GPUs
#24022
Merged

[Misc] Enable V1 FP16 inference on pre-Ampere GPUs #24022

Isotr0py
Isotr0py fully enable v1 on turing
6b3fbfb8
Isotr0py Isotr0py requested a review from DarkLight1337 DarkLight1337 120 days ago
gemini-code-assist
gemini-code-assist commented on 2025-09-01
DarkLight1337
DarkLight1337 approved these changes on 2025-09-01
DarkLight1337 DarkLight1337 enabled auto-merge (squash) 120 days ago
github-actions github-actions added ready
jeejeelee
jeejeelee approved these changes on 2025-09-01
DarkLight1337 DarkLight1337 merged d7fbc6dd into main 120 days ago
Isotr0py Isotr0py deleted the v1-turing branch 120 days ago
Jyothirmaikottu

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone