xla
bea86eec
- Update ragged paged attention kernel to prevent vmem oom (#9346)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
142 days ago
Update ragged paged attention kernel to prevent vmem oom (#9346) Signed-off-by: Chenyaaang <chenyangli@google.com>
References
#9346 - Update ragged paged attention kernel to prevent vmem oom
Author
Chenyaaang
Parents
3a1ed628
Loading