vllm
01a583fe - [Kernel] Decouple Tile Size from Block Size in Triton Unified Attention Kernel (#21197)

Commit
81 days ago
[Kernel] Decouple Tile Size from Block Size in Triton Unified Attention Kernel (#21197) Signed-off-by: Jan van Lunteren <jvl@zurich.ibm.com>
Author
Parents
Loading