text-generation-inference
Add support for exl2-quantized models
#1965
Merged

Commits
  • Add support for exl2 quantization
    danieldk committed 1 year ago
  • Gemma GPTQ checks: skip logprob checks
    danieldk committed 1 year ago
Loading