vllm
Add nvfp4 support to reshape_and_cache_flash
#37332
Merged

Add nvfp4 support to reshape_and_cache_flash #37332

vllm-bot merged 9 commits into vllm-project:main from sychen52:nvfp4_kv
sychen52
sychen52 sychen52 requested a review from mgoin mgoin 103 days ago
sychen52 sychen52 requested a review from tlrmchlsmth tlrmchlsmth 103 days ago
sychen52 sychen52 requested a review from WoosukKwon WoosukKwon 103 days ago
sychen52 sychen52 requested a review from yewentao256 yewentao256 103 days ago
sychen52 sychen52 requested a review from njhill njhill 103 days ago
sychen52 sychen52 requested a review from heheda12345 heheda12345 103 days ago
sychen52 sychen52 requested a review from pavanimajety pavanimajety 103 days ago
sychen52 sychen52 requested a review from zhuohan123 zhuohan123 103 days ago
sychen52 sychen52 requested a review from youkaichao youkaichao 103 days ago
sychen52 sychen52 requested a review from alexm-redhat alexm-redhat 103 days ago
sychen52 sychen52 requested a review from LucasWilkinson LucasWilkinson 103 days ago
sychen52 sychen52 requested a review from MatthewBonanni MatthewBonanni 103 days ago
gemini-code-assist
gemini-code-assist commented on 2026-03-17
sychen52 sychen52 changed the title add nvfp4 support to reshape_and_cache_flash Add nvfp4 support to reshape_and_cache_flash 103 days ago
sychen52 sychen52 force pushed from e6088718 103 days ago
sychen52 sychen52 force pushed 103 days ago
mergify
mergify mergify added documentation
mergify mergify added ci/build
mergify mergify added nvidia
mergify mergify added v1
sychen52 sychen52 force pushed 103 days ago
pavanimajety
pavanimajety commented on 2026-03-18
mergify
mergify mergify added needs-rebase
sychen52 sychen52 force pushed 96 days ago
sychen52 sychen52 requested a review from robertgshaw2-redhat robertgshaw2-redhat 96 days ago
mergify mergify removed needs-rebase
sychen52 sychen52 force pushed 96 days ago
sychen52 sychen52 requested a review from pavanimajety pavanimajety 96 days ago
sychen52 sychen52 force pushed 95 days ago
pavanimajety
pavanimajety
pavanimajety commented on 2026-03-30
sychen52 sychen52 force pushed 90 days ago
sychen52 sychen52 requested a review from vadiklyutiy vadiklyutiy 90 days ago
mergify
mergify mergify added needs-rebase
sychen52 sychen52 force pushed 89 days ago
mergify mergify removed needs-rebase
sychen52 sychen52 force pushed 89 days ago
mergify
mergify mergify added needs-rebase
sychen52 sychen52 force pushed 88 days ago
mergify mergify removed needs-rebase
mergify
mergify mergify added needs-rebase
sychen52 sychen52 force pushed 87 days ago
mergify mergify removed needs-rebase
sychen52 sychen52 force pushed to 80521c44 87 days ago
pavanimajety pavanimajety added verified
mergify
sychen52 sychen52 force pushed 87 days ago
mergify
pavanimajety pavanimajety added ready
sychen52 sychen52 force pushed 86 days ago
sychen52 sychen52 force pushed 86 days ago
Edwardf0t1
pavanimajety
LucasWilkinson
LucasWilkinson requested changes on 2026-04-10
sychen52 sychen52 requested a review from LucasWilkinson LucasWilkinson 79 days ago
sychen52 sychen52 force pushed 79 days ago
mergify
mergify mergify added needs-rebase
sychen52 sychen52 force pushed 79 days ago
mergify mergify removed needs-rebase
sychen52 sychen52 force pushed 79 days ago
vadiklyutiy
vadiklyutiy commented on 2026-04-10
pavanimajety
pavanimajety approved these changes on 2026-04-14
sychen52 sychen52 force pushed 75 days ago
sychen52 sychen52 force pushed 75 days ago
sychen52 sychen52 force pushed 74 days ago
sychen52 sychen52 force pushed 74 days ago
sychen52 sychen52 force pushed 73 days ago
pavanimajety pavanimajety enabled auto-merge (squash) 73 days ago
pavanimajety
mgoin
mgoin approved these changes on 2026-04-16
sychen52 add nvfp4 support to reshape_and_cache_flash
d116847b
sychen52 change the nvfp4 kv cache layout
f3347eac
sychen52 changes based on comments
d5fcfe7e
sychen52 fix libtorch_stable change
a33ebbdf
sychen52 only swizzle on v block scale; remove cache_dtype_str after rebase
55c552da
sychen52 pre-commit
3d526f2a
sychen52 fixed unittest on non supported GPU
c8968868
sychen52 do not show nvfp4 support for now
befb6467
disabled auto-merge 73 days ago
Head branch was pushed to by a user without write access
sychen52 sychen52 force pushed to befb6467 73 days ago
sychen52
pavanimajety
pavanimajety commented on 2026-04-16
sychen52 add NonImplementedError
54c34587
pavanimajety pavanimajety enabled auto-merge (squash) 73 days ago
vllm-bot vllm-bot merged 6b2b7bd0 into main 72 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone