text-generation-inference
f1404400 - fp8 compressed tensors w8a8 support for Gaudi backend (#3242)

Commit
206 days ago
fp8 compressed tensors w8a8 support for Gaudi backend (#3242) Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
Author
Parents
Loading