vllm
[FlashInfer] Revert block_size 16 + head_size 256 workaround on Blackwell
#36987
Merged

Commits
  • remove workaround for FI page_size for hybrid models
    vadiklyutiy committed 111 days ago
Loading