DeepSpeed
af4356b3
- Fix cuda hardcode for inference woq (#5565)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Hide Minimap (CTRL+M)
Commit
1 year ago
Fix cuda hardcode for inference woq (#5565) This is a simple fix for inference woq part, changing from `'cuda'` to `get_accelerator().device_name()`. --------- Co-authored-by: Logan Adams <114770087+loadams@users.noreply.github.com>
References
#5565 - Fix cuda hardcode for inference woq
Author
Liangliang-Ma
Parents
f4cb866c
Files
1
deepspeed/inference/quantization
utils.py
Loading