vllm
a77aea59
- [TPU] support attention head dim smaller than 128 (#19620)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
203 days ago
[TPU] support attention head dim smaller than 128 (#19620) Signed-off-by: Chengji Yao <chengjiyao@google.com> Co-authored-by: mgoin <mgoin64@gmail.com>
References
#19620 - [TPU] support attention head dim smaller than 128
Author
yaochengji
Parents
b692e9cd
Loading