auto-round
fix gguf fp8 input model and vram issue
#844
Merged

fix gguf fp8 input model and vram issue #844

wenhuach21 merged 9 commits into main from fix_gguf_fp8
wenhuach21
wenhuach21 fix gguf fp8 input model issue
3f3c6c16
n1ck-guo
n1ck-guo approved these changes on 2025-09-23
wenhuach21 for debug, still has some oom issue
449b0ed1
pre-commit-ci[bot] [pre-commit.ci] auto fixes from pre-commit.com hooks
edb8c24c
wenhuach21 Merge branch 'main' into fix_gguf_fp8
293ec791
wenhuach21 refine a little
b30029e7
n1ck-guo fix gguf
f6eec55a
n1ck-guo Merge branch 'main' into fix_gguf_fp8
f8aa6113
n1ck-guo clean
5c3917ca
wenhuach21 rm value error
322e8f4f
wenhuach21 wenhuach21 changed the title fix gguf fp8 input model issue fix gguf fp8 input model and vram issue 95 days ago
wenhuach21 wenhuach21 merged 4562cf47 into main 95 days ago
wenhuach21 wenhuach21 deleted the fix_gguf_fp8 branch 95 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone