text-generation-inference
Hotfix: fix of use of unquantized weights in Mixtral GQA loading
#2269
Merged

Hotfix: fix of use of unquantized weights in Mixtral GQA loading #2269

danieldk merged 5 commits into huggingface:main from main
icyxp
icyxp Update idefics_causal_lm.py
0c7559b7
icyxp Merge branch 'huggingface:main' into main
28cecb66
icyxp fix dbrx & opt model prefix bug
39944e1c
icyxp Merge branch 'huggingface:main' into main
6111e9ec
icyxp Hotfix: fix of use of unquantized weights in Mixtral GQA loading
dc70cf9d
danieldk
danieldk approved these changes on 2024-07-22
danieldk danieldk merged 4e420722 into main 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone