vllm
3fb0d909 - [AMD] Use Decoupled Kernel Block Size to Support AITER MLA block_size=1 (#27715)

Commit
21 days ago
[AMD] Use Decoupled Kernel Block Size to Support AITER MLA block_size=1 (#27715) Signed-off-by: chiangzhang <chiangzhang@tencent.com>
Author
Parents
Loading