vllm
77d2a5f1
- pick up tuned prefill configs for FP8 FA3 (#36265)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
42 days ago
pick up tuned prefill configs for FP8 FA3 (#36265) Signed-off-by: Jonas M. Kübler <44084297+jmkuebler@users.noreply.github.com> Signed-off-by: Jonas Kuebler <kuebj@amazon.com>
Author
jmkuebler
Parents
59192dfd
Loading