text-generation-inference
5f78ec32 - Do not convert weight scale to e4m3fnuz on CUDA (#2917)

Commit
1 year ago
Do not convert weight scale to e4m3fnuz on CUDA (#2917)
Author
Parents
Loading