auto-round
speedup packing stage for autogptq and autoawq format
#407
Merged

speedup packing stage for autogptq and autoawq format #407

wenhuach21 merged 17 commits into main from cuda_packing
wenhuach21
wenhuach21 use cuda for packing
7a8f48ec
wenhuach21 Merge branch 'main' into cuda_packing
fe1e08f7
wenhuach21 update
8f7b9b6a
wenhuach21 wenhuach21 marked this pull request as draft 349 days ago
wenhuach21 update
66a4c3bc
wenhuach21 update auto-round triton
ab3d93c1
wenhuach21 speed up autoawq packing
ec04b597
wenhuach21 Merge branch 'cuda_packing' of https://github.com/intel/auto-round in…
7f090745
wenhuach21 wenhuach21 changed the title use cuda for packing speedup packing stage for autogptq and autoawq format 349 days ago
wenhuach21 Merge branch 'main' into cuda_packing
a71f60b7
wenhuach21 wenhuach21 marked this pull request as ready for review 349 days ago
wenhuach21 wenhuach21 requested a review from WeiweiZhang1 WeiweiZhang1 349 days ago
wenhuach21 wenhuach21 requested a review from n1ck-guo n1ck-guo 349 days ago
wenhuach21 trigger test
0bc89ab0
wenhuach21 Merge branch 'cuda_packing' of https://github.com/intel/auto-round in…
121c337c
wenhuach21 fix line too long issue
fceea4f0
wenhuach21 fix bug
a52a88a7
wenhuach21 fix bug
6f309bc4
wenhuach21 try to fix ut
e45a73f7
wenhuach21 Merge branch 'main' into cuda_packing
5af95930
n1ck-guo
n1ck-guo approved these changes on 2025-01-15
WeiweiZhang1
WeiweiZhang1 approved these changes on 2025-01-15
wenhuach21 update README.md
f41aea01
wenhuach21 Merge branch 'cuda_packing' of https://github.com/intel/auto-round in…
5db79385
wenhuach21 wenhuach21 merged 937d0198 into main 347 days ago
wenhuach21 wenhuach21 deleted the cuda_packing branch 347 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone