text-generation-inference
4e420722 - Hotfix: fix of use of unquantized weights in Mixtral GQA loading (#2269)

Commit
1 year ago
Hotfix: fix of use of unquantized weights in Mixtral GQA loading (#2269) * Update idefics_causal_lm.py Fix syntax issues * fix dbrx & opt model prefix bug * Hotfix: fix of use of unquantized weights in Mixtral GQA loading
Author
Parents
Loading