text-generation-inference
Attempt for cleverer auto batch_prefill values (some simplifications).
#2808
Merged

Attempt for cleverer auto batch_prefill values (some simplifications). #2808

Narsil
Narsil Attempt for cleverer auto batch_prefill values (some simplifications).
037ea55a
Narsil Less flaky tests.
a0003a62
Narsil Fixing typo insertion.
5b04d6c4
Narsil Narsil requested a review from danieldk danieldk 1 year ago
Narsil Narsil requested a review from drbh drbh 1 year ago
danieldk
danieldk commented on 2024-12-09
Narsil Update launcher/src/main.rs
36ed43c9
Narsil Adding small comment for source of calculation.
d701f9e8
Narsil Adding L40.
908dec63
Narsil Adding L40s.
14d19738
drbh
drbh approved these changes on 2024-12-09
Narsil Narsil merged a04356fb into main 1 year ago
Narsil Narsil deleted the update_max_prefill_auto_with_vram_reqs branch 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone