vllm
8374387b
- [FlashInfer] Revert block_size 16 + head_size 256 workaround on Blackwell (#36987)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
29 days ago
[FlashInfer] Revert block_size 16 + head_size 256 workaround on Blackwell (#36987) Signed-off-by: Vadim Gimpelson <vadim.gimpelson@gmail.com>
References
#36987 - [FlashInfer] Revert block_size 16 + head_size 256 workaround on Blackwell
Author
vadiklyutiy
Parents
912fbe95
Loading