vllm
3fb0d909
- [AMD] Use Decoupled Kernel Block Size to Support AITER MLA block_size=1 (#27715)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
21 days ago
[AMD] Use Decoupled Kernel Block Size to Support AITER MLA block_size=1 (#27715) Signed-off-by: chiangzhang <chiangzhang@tencent.com>
References
#27715 - [AMD] Use Decoupled Kernel Block Size to Support AITER MLA block_size=1
Author
zq1997
Parents
05c2dee7
Loading