refine inference step 2 #498
support reminding better backend
ffe91c76
support 3bits on cuda
bbbcccba
fix 3bits asym
5b055654
tiny change
cf2057cb
support loading gptq/awq to autoround format
daf45c7d
wenhuach21
changed the title [WIP]refine inference step 2 refine inference step 2 341 days ago
Update auto_round/autoround.py
8130667d
tiny change
728c2c37
Merge branch 'refine_inference' of https://github.com/intel/auto-roun…
9313c444
n1ck-guo
approved these changes
on 2025-04-09
fix typo
d2f7087b
add more info
db673adb
fix typo
ad361f86
fix typo
1f5542e9
fix some issues
50ba0a2f
fix ut
760f8412
wenhuach21
deleted the refine_inference branch 341 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub