text-generation-inference
f1404400
- fp8 compressed tensors w8a8 support for Gaudi backend (#3242)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
206 days ago
fp8 compressed tensors w8a8 support for Gaudi backend (#3242) Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
References
#3242 - fp8 compressed_tensors w8a8 support
Author
sywangyi
Parents
1883a62a
Loading