speedup packing stage for autogptq and autoawq format #407
use cuda for packing
7a8f48ec
Merge branch 'main' into cuda_packing
fe1e08f7
update
8f7b9b6a
wenhuach21
marked this pull request as draft 349 days ago
update
66a4c3bc
update auto-round triton
ab3d93c1
speed up autoawq packing
ec04b597
Merge branch 'cuda_packing' of https://github.com/intel/auto-round in…
7f090745
wenhuach21
changed the title use cuda for packing speedup packing stage for autogptq and autoawq format 349 days ago
Merge branch 'main' into cuda_packing
a71f60b7
wenhuach21
marked this pull request as ready for review 349 days ago
trigger test
0bc89ab0
Merge branch 'cuda_packing' of https://github.com/intel/auto-round in…
121c337c
fix line too long issue
fceea4f0
fix bug
a52a88a7
fix bug
6f309bc4
try to fix ut
e45a73f7
Merge branch 'main' into cuda_packing
5af95930
n1ck-guo
approved these changes
on 2025-01-15
update README.md
f41aea01
Merge branch 'cuda_packing' of https://github.com/intel/auto-round in…
5db79385
wenhuach21
deleted the cuda_packing branch 347 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub