auto-round
fix fp_layers issue and force to FP16 on cuda for autoround format inference
#326
Merged

fix fp_layers issue and force to FP16 on cuda for autoround format inference #326

WeiweiZhang1 merged 9 commits into main from bix_1115
wenhuach21
wenhuach21 fix merge error
52ec5615
wenhuach21 fix fp_layers issues
7af3e8ad
wenhuach21 Merge branch 'main' into bix_1115
40a72f2f
wenhuach21 Loosen the restrictions of lm-eval
d44266a7
wenhuach21 fix and add ut
b8331ec6
wenhuach21 fix
c13fc3f0
wenhuach21 API usage does not support fuzzy match
1211ab2d
WeiweiZhang1 Merge branch 'main' into bix_1115
83d53327
WeiweiZhang1 bugfix of UT
523a316e
WeiweiZhang1
WeiweiZhang1 approved these changes on 2024-11-19
WeiweiZhang1 WeiweiZhang1 merged 459aab03 into main 1 year ago
WeiweiZhang1 WeiweiZhang1 deleted the bix_1115 branch 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone