text-generation-inference
4844ff79 - fix(server): fix fp8 weight loading (#2268)

Commit
1 year ago
fix(server): fix fp8 weight loading (#2268) * fix(server): fix fp8 weight loading * fixed scales loading * update snap * revert default dtype
Parents
Loading