text-generation-inference
fp8 compressed_tensors w8a8 support
#3242
Merged

fp8 compressed_tensors w8a8 support #3242

regisss merged 4 commits into huggingface:main from sywangyi:fp8_compressor
sywangyi
sywangyi fp8 compressed_tensors w8a8 support
4ffa111f
sywangyi
sywangyi
sywangyi Merge branch 'main' into fp8_compressor
a2934644
sywangyi remove print
ce8978f9
regisss
regisss dismissed these changes on 2025-05-26
regisss
sywangyi add multi-weight for GPTQ weight loader
475f6e21
sywangyi sywangyi dismissed their stale review via 475f6e21 209 days ago
Narsil
Narsil approved these changes on 2025-05-28
regisss regisss merged f1404400 into main 208 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone