vllm
889da130 - [ Misc ] `fp8-marlin` channelwise via `compressed-tensors` (#6524)

Commit
1 year ago
[ Misc ] `fp8-marlin` channelwise via `compressed-tensors` (#6524) Co-authored-by: mgoin <michael@neuralmagic.com>
Parents
Loading