vllm
0e237f00 - [FEAT][ROCm] Integrate Paged Attention Kernel from AITER (#15001)

Commit
234 days ago
[FEAT][ROCm] Integrate Paged Attention Kernel from AITER (#15001) Signed-off-by: vllmellm <vllm.ellm@embeddedllm.com> Signed-off-by: tjtanaa <tunjian.tan@embeddedllm.com> Co-authored-by: tjtanaa <tunjian.tan@embeddedllm.com>
Author
Parents
Loading