vllm
e7523c2e
- [V1][Sampler] Improve performance of FlashInfer sampling by sampling logits instead of probs (#18608)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
203 days ago
[V1][Sampler] Improve performance of FlashInfer sampling by sampling logits instead of probs (#18608)
References
#18608 - [V1][Sampler] Improve performance of FlashInfer sampling by sampling logits instead of probs
Author
lgeiger
Parents
a869baca
Loading