vllm
a1257fd1 - [Kernel] Add FP8 KV cache support to Triton MLA decode attention (#34597)

Commit
52 days ago
[Kernel] Add FP8 KV cache support to Triton MLA decode attention (#34597) Signed-off-by: grimulkan <grimulkan@gmail.com>
Author
Parents
Loading