auto-round
fix gguf regression for large models
#680
Merged

fix gguf regression for large models #680

wenhuach21 merged 4 commits into main from update_0722
wenhuach21
wenhuach21 trigger ut
e0396b9a
wenhuach21 try to fix block_wise rtn issue
5f606646
wenhuach21 wenhuach21 changed the title trigger ut fix block_wise rtn issue 163 days ago
n1ck-guo
n1ck-guo approved these changes on 2025-07-22
wenhuach21 refine
11949823
wenhuach21 wenhuach21 changed the title fix block_wise rtn issue fix gguf regression for large models 163 days ago
wenhuach21 wenhuach21 requested a review from copilot-pull-request-reviewer copilot-pull-request-reviewer 163 days ago
copilot-pull-request-reviewer
copilot-pull-request-reviewer commented on 2025-07-22
wenhuach21 clear cuda memory
ac00585c
wenhuach21 wenhuach21 merged 21bb06b6 into main 163 days ago
wenhuach21 wenhuach21 deleted the update_0722 branch 163 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone