vllm
[Bugfix][Attention] Explicitly report support for kv_cache_dtype bfloat16
#32795
Merged

[Bugfix][Attention] Explicitly report support for kv_cache_dtype bfloat16 #32795

MatthewBonanni
MatthewBonanni Add bfloat16 explicitly
6e06067b
MatthewBonanni MatthewBonanni requested a review from pavanimajety pavanimajety 33 days ago
MatthewBonanni MatthewBonanni changed the title [Attention] Explicitly report support for kv_cache_dtype bfloat16 [Bugfix][Attention] Explicitly report support for kv_cache_dtype bfloat16 33 days ago
mergify mergify added nvidia
mergify mergify added v1
mergify mergify added bug
gemini-code-assist
gemini-code-assist commented on 2026-01-21
MatthewBonanni Update regular attention backends and base class
077fa118
MatthewBonanni MatthewBonanni requested a review from tdoublep tdoublep 33 days ago
MatthewBonanni MatthewBonanni requested a review from mgoin mgoin 33 days ago
MatthewBonanni MatthewBonanni requested a review from WoosukKwon WoosukKwon 33 days ago
MatthewBonanni MatthewBonanni requested a review from zhuohan123 zhuohan123 33 days ago
MatthewBonanni MatthewBonanni requested a review from youkaichao youkaichao 33 days ago
MatthewBonanni MatthewBonanni requested a review from alexm-redhat alexm-redhat 33 days ago
MatthewBonanni MatthewBonanni requested a review from njhill njhill 33 days ago
MatthewBonanni MatthewBonanni requested a review from LucasWilkinson LucasWilkinson 33 days ago
MatthewBonanni MatthewBonanni marked this pull request as draft 33 days ago
MatthewBonanni Fix references to auto
dae74fe0
mergify mergify added cpu
MatthewBonanni MatthewBonanni marked this pull request as ready for review 33 days ago
MatthewBonanni MatthewBonanni requested a review from bigPYJ1151 bigPYJ1151 33 days ago
MatthewBonanni MatthewBonanni requested a review from robertgshaw2-redhat robertgshaw2-redhat 33 days ago
MatthewBonanni MatthewBonanni requested a review from tlrmchlsmth tlrmchlsmth 33 days ago
MatthewBonanni MatthewBonanni requested a review from yewentao256 yewentao256 33 days ago
ProExpertProg
ProExpertProg approved these changes on 2026-01-22
MatthewBonanni Use is_quantized_kv_cache
dc7fce14
tlrmchlsmth
tlrmchlsmth approved these changes on 2026-01-22
ProExpertProg ProExpertProg enabled auto-merge (squash) 32 days ago
github-actions github-actions added ready
ProExpertProg Merge branch 'main' into attn_bf16
4bfed7d8
ProExpertProg ProExpertProg merged 955b43a5 into main 32 days ago
MatthewBonanni MatthewBonanni deleted the attn_bf16 branch 32 days ago

Login to write a write a comment.

Login via GitHub