vllm
aa2cd2c4
- [Bugfix] Disable w16a16 2of4 sparse CompressedTensors24 (#12417)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
325 days ago
[Bugfix] Disable w16a16 2of4 sparse CompressedTensors24 (#12417) Signed-off-by: Tyler Michael Smith <tyler@neuralmagic.com> Co-authored-by: mgoin <michael@neuralmagic.com>
References
#12417 - [Bugfix] Disable w16a16 2of4 sparse CompressedTensors24
Author
tlrmchlsmth
Parents
9ddc3522
Loading