text-generation-inference
86422506
- fix of use of unquantized weights in cohere GQA loading, also enable … (#2291)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 year ago
fix of use of unquantized weights in cohere GQA loading, also enable … (#2291) fix of use of unquantized weights in cohere GQA loading, also enable the model in intel platform Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
References
#2291 - fix of use of unquantized weights in cohere GQA loading, also enable …
Author
sywangyi
Parents
5ad39dd3
Loading