xla
2edcd2e3
- [Kernel] support kv cache quantization in ragged attention kernel (#9249)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
192 days ago
[Kernel] support kv cache quantization in ragged attention kernel (#9249)
References
#9249 - [Kernel] support kv cache quantization in ragged attention kernel
Author
yaochengji
Parents
248a8b33
Loading