vllm
7ff7a638
- [Model][Quant] Fix GLM, Fix fused module mappings for quantization (#12634)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
308 days ago
[Model][Quant] Fix GLM, Fix fused module mappings for quantization (#12634) Signed-off-by: mgoin <michael@neuralmagic.com> Signed-off-by: Kyle Sayers <kylesayrs@gmail.com> Co-authored-by: mgoin <michael@neuralmagic.com>
References
#12634 - [Model][Quant] Fix GLM, Fix fused module mappings for quantization
Author
kylesayrs
Parents
686006a2
Loading