Attempt for cleverer auto batch_prefill values (some simplifications). #2808
Attempt for cleverer auto batch_prefill values (some simplifications).
037ea55a
Less flaky tests.
a0003a62
Fixing typo insertion.
5b04d6c4
Update launcher/src/main.rs
36ed43c9
Adding small comment for source of calculation.
d701f9e8
Adding L40.
908dec63
Adding L40s.
14d19738
drbh
approved these changes
on 2024-12-09
Narsil
merged
a04356fb
into main 1 year ago
Narsil
deleted the update_max_prefill_auto_with_vram_reqs branch 1 year ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub