Support direct quantization for FP8 matmul #3922
wenscarl
changed the title Support direct quantization for FP8 matmul [draft]Support direct quantization for FP8 matmul 1 year ago
kaixih
commented
on 2024-05-14
kaixih
commented
on 2024-05-23
kaixih
commented
on 2024-05-29
wenscarl
changed the title [draft]Support direct quantization for FP8 matmul Support direct quantization for FP8 matmul 1 year ago
kaixih
commented
on 2024-05-30
kaixih
commented
on 2024-08-22
kaixih
approved these changes
on 2024-08-30
Direct quantization for FP8 Dense Layer.
6f5cee12
wenscarl
force pushed
to
6f5cee12
1 year ago
kaixih
approved these changes
on 2024-09-03
levskaya
approved these changes
on 2024-09-04
levskaya
approved these changes
on 2024-09-04
Clean imports
4f5a722e
Add type: ignore
b4dc0949
Remove unnecessary warnings.
e9d19aca
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub