text-generation-inference
86422506 - fix of use of unquantized weights in cohere GQA loading, also enable … (#2291)

Commit
1 year ago
fix of use of unquantized weights in cohere GQA loading, also enable … (#2291) fix of use of unquantized weights in cohere GQA loading, also enable the model in intel platform Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
Author
Parents
Loading