vllm
[FlashInfer] Revert block_size 16 + head_size 256 workaround on Blackwell
#36987
Merged

[FlashInfer] Revert block_size 16 + head_size 256 workaround on Blackwell #36987

vadiklyutiy
vadiklyutiy remove workaround for FI page_size for hybrid models
ee69b5a4
vadiklyutiy vadiklyutiy requested a review from mgoin mgoin 107 days ago
vadiklyutiy vadiklyutiy requested a review from pavanimajety pavanimajety 107 days ago
vadiklyutiy vadiklyutiy removed review request from mgoin mgoin 107 days ago
vadiklyutiy vadiklyutiy removed review request from pavanimajety pavanimajety 107 days ago
vadiklyutiy vadiklyutiy requested a review from heheda12345 heheda12345 107 days ago
vadiklyutiy vadiklyutiy requested a review from pavanimajety pavanimajety 107 days ago
mergify mergify added nvidia
mergify mergify added v1
vadiklyutiy vadiklyutiy requested a review from mgoin mgoin 107 days ago
gemini-code-assist
gemini-code-assist commented on 2026-03-13
vadiklyutiy vadiklyutiy added qwen
vadiklyutiy vadiklyutiy assigned vadiklyutiy vadiklyutiy 107 days ago
vadiklyutiy vadiklyutiy added ready
hmellor
vadiklyutiy
hmellor
hmellor
hmellor approved these changes on 2026-03-16
hmellor hmellor merged 8374387b into main 104 days ago
vadiklyutiy
hmellor

Login to write a write a comment.

Login via GitHub

Assignees
Labels
Milestone