text-generation-inference
86422506 - fix of use of unquantized weights in cohere GQA loading, also enable … (#2291)

Commit

1 year ago

fix of use of unquantized weights in cohere GQA loading, also enable … (#2291) fix of use of unquantized weights in cohere GQA loading, also enable the model in intel platform Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

References

#2291 - fix of use of unquantized weights in cohere GQA loading, also enable …

Author

sywangyi

Parents

5ad39dd3

text-generation-inference 86422506 - fix of use of unquantized weights in cohere GQA loading, also enable … (#2291)

text-generation-inference
86422506 - fix of use of unquantized weights in cohere GQA loading, also enable … (#2291)