transformers
Enable Quantize KV Cache for Mistral Model
#35042
Merged

Enable Quantize KV Cache for Mistral Model #35042

Bojun-Feng
Bojun-Feng fix #35041
89eb601d
zucchini-nlp
zucchini-nlp approved these changes on 2024-12-03
zucchini-nlp
zucchini-nlp zucchini-nlp requested a review from Rocketknight1 Rocketknight1 1 year ago
HuggingFaceDocBuilderDev
Bojun-Feng
zucchini-nlp
zucchini-nlp zucchini-nlp merged 96618960 into main 288 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone