vllm
a1257fd1 - [Kernel] Add FP8 KV cache support to Triton MLA decode attention (#34597)

Commit

52 days ago

[Kernel] Add FP8 KV cache support to Triton MLA decode attention (#34597) Signed-off-by: grimulkan <grimulkan@gmail.com>

References

Author

grimulkan

Parents