vllm
Fix INT8 quantization error on Blackwell GPUs (SM100+)
#25935
Merged

Fix INT8 quantization error on Blackwell GPUs (SM100+) #25935

certainly-param
certainly-param certainly-param requested a review from mgoin mgoin 252 days ago
certainly-param certainly-param requested a review from robertgshaw2-redhat robertgshaw2-redhat 252 days ago
certainly-param certainly-param requested a review from tlrmchlsmth tlrmchlsmth 252 days ago
certainly-param certainly-param requested a review from yewentao256 yewentao256 252 days ago
github-actions
mergify mergify added documentation
gemini-code-assist
gemini-code-assist commented on 2025-09-30
certainly-param Add INT8 check for Blackwell GPUs
930c65bb
certainly-param Use capability.to_int() for consistency
7bfb5724
certainly-param Fix pre-commit issues
daaec73b
certainly-param Remove invalid weight_dtype check
151ea664
certainly-param Apply clang-format to C++ code
dd0a7b00
certainly-param certainly-param force pushed to dd0a7b00 252 days ago
yewentao256
yewentao256 commented on 2025-09-30
certainly-param
yewentao256
certainly-param certainly-param force pushed to dd0a7b00 252 days ago
certainly-param
yewentao256
yewentao256 approved these changes on 2025-09-30
yewentao256 yewentao256 added ready
mgoin
mgoin approved these changes on 2025-09-30
mgoin mgoin enabled auto-merge (squash) 252 days ago
vllm-bot vllm-bot merged 99028fda into main 251 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone