DeepSpeed
af4356b3 - Fix cuda hardcode for inference woq (#5565)

Comment changes are shownComment changes are hidden
Commit
1 year ago
Fix cuda hardcode for inference woq (#5565) This is a simple fix for inference woq part, changing from `'cuda'` to `get_accelerator().device_name()`. --------- Co-authored-by: Logan Adams <114770087+loadams@users.noreply.github.com>
Author
Parents
  • deepspeed/inference/quantization
    • File
      utils.py