vllm
a8c53682
- Consolidate Nvidia ModelOpt quant config handling for all quantization methods (#28076)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
98 days ago
Consolidate Nvidia ModelOpt quant config handling for all quantization methods (#28076) Signed-off-by: Shengliang Xu <shengliangx@nvidia.com>
References
#28076 - Consolidate Nvidia ModelOpt quant config handling for all quantization methods
Author
shengliangxu
Parents
fcbcba6c
Loading