vllm
87baebeb
- [Frontend][TPU] Add TPU default max-num-batched-tokens based on device name (#17508)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
222 days ago
[Frontend][TPU] Add TPU default max-num-batched-tokens based on device name (#17508) Signed-off-by: Chenyaaang <chenyangli@google.com>
References
#17508 - [Frontend][TPU] Add TPU default max-num-batched-tokens based on device name
Author
Chenyaaang
Parents
e3d0a1d1
Loading