optimum
aba7f46d - add_exllamav2 (#1419)

Commit
1 year ago
add_exllamav2 (#1419) * add_exllamav2 * style * fix doc * fix doc * raise error * Update docs/source/llm_quantization/usage_guides/quantization.mdx Co-authored-by: fxmarty <9808326+fxmarty@users.noreply.github.com> * update doc * update min version of autogptq --------- Co-authored-by: fxmarty <9808326+fxmarty@users.noreply.github.com>
Author
Parents
  • docs/source/llm_quantization/usage_guides
    • File
      quantization.mdx
  • optimum
    • gptq
      • File
        quantizer.py
    • utils
      • File
        import_utils.py
  • tests/gptq
    • File
      test_quantization.py