text-generation-inference
57b34958 - Fixing exl2 and other quanize tests again. (#2419)

Commit
1 year ago
Fixing exl2 and other quanize tests again. (#2419) * Fixing exl2 and other quanize tests again. * Mark exl2 as non release (so CI tests them, needs to be removed latet). * Fixing exl2 (by disabling cuda graphs) * Fix quantization defaults without cuda graphs on exl2 (linked to new issues with it). * Removing serde override. * Go back to released exl2 and remove log. * Adding warnings for deprecated bitsandbytes + upgrade info to warn.
Author
Parents
Loading