vllm
ccd7c050
- [Kernel] Add Split-KV Support to Unified Triton Attention Kernel (#19152)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
201 days ago
[Kernel] Add Split-KV Support to Unified Triton Attention Kernel (#19152) Signed-off-by: Jan van Lunteren <jvl@zurich.ibm.com>
References
#19152 - [Kernel] Add Split-KV Support to Unified Triton Attention Kernel
Author
jvlunteren
Parents
c48c6c40
Loading