text-generation-inference
4e420722
- Hotfix: fix of use of unquantized weights in Mixtral GQA loading (#2269)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 year ago
Hotfix: fix of use of unquantized weights in Mixtral GQA loading (#2269) * Update idefics_causal_lm.py Fix syntax issues * fix dbrx & opt model prefix bug * Hotfix: fix of use of unquantized weights in Mixtral GQA loading
References
#2269 - Hotfix: fix of use of unquantized weights in Mixtral GQA loading
Author
icyxp
Parents
f3435bab
Loading