vllm
9a2a6357
- [Bugfix] Fix FP8 Marlin MoE and enable for compressed-tensors models (#18026)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
269 days ago
[Bugfix] Fix FP8 Marlin MoE and enable for compressed-tensors models (#18026) Signed-off-by: mgoin <mgoin64@gmail.com>
References
#18026 - [Bugfix] Fix FP8 Marlin MoE and enable for compressed-tensors models
Author
mgoin
Parents
6266c57b
Loading