llama.cpp
server: add margin for draft model for `fit`
#23485
Merged

server: add margin for draft model for `fit` #23485

am17an
am17an am17an requested a review 36 days ago
am17an am17an requested a review from JohannesGaessler JohannesGaessler 36 days ago
bartowski1182
bartowski1182 commented on 2026-05-21
miloslavnosek
sourenaraya
github-actions github-actions added examples
github-actions github-actions added server
am17an
am17an server: add margin for draft model for `fit`
ac0a2350
am17an am17an force pushed from 0b2f2461 to ac0a2350 35 days ago
am17an clarify worst case memory usage for MTP context
47112993
am17an
JohannesGaessler
JohannesGaessler commented on 2026-05-22
am17an use params.devices if not empty
da51d57b
JohannesGaessler
JohannesGaessler approved these changes on 2026-05-24
ServeurpersoCom
ServeurpersoCom approved these changes on 2026-05-24
am17an am17an merged 83eebe9d into master 33 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone