vllm
[FlashInfer] Revert block_size 16 + head_size 256 workaround on Blackwell
#36987
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
1
Changes
View On
GitHub
Commits
remove workaround for FI page_size for hybrid models
vadiklyutiy
committed
111 days ago
Loading