vllm
868c546d
- Support W8A8 INT8 MoE for compressed-tensors (#16745)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
224 days ago
Support W8A8 INT8 MoE for compressed-tensors (#16745) Signed-off-by: mgoin <mgoin64@gmail.com>
References
#16745 - Support W8A8 INT8 MoE for compressed-tensors
Author
mgoin
Parents
99404f53
Loading