Address ZeroK case for Gemm for CPU and CUDA #22111
Address ZeroK case for Gemm for CPU and CUDA
c50c04d0
snnn
commented
on 2024-09-17
snnn
commented
on 2024-09-17
Add QGemm K == 0 handling
4713219f
yuslepukhin
dismissed their stale review
via 4713219f
1 year ago
Make GCC happy with .template<>()
aba1d52e
Address data conversion issues
5c336c77
Implemenet clipping properly
837fcf73
snnn
dismissed these changes
on 2024-09-19
Rework BiasBroadcast
0353d5f6
yuslepukhin
dismissed their stale review
via 0353d5f6
1 year ago
Fix zp fill out
c68f2da5
Rework zp fill out
96017936
Make sure output is float with y_zp not present
9ca55b41
Account for b_scale being a vector of N
09f42ecc
Remove QGemm changes
af0d25e2
snnn
dismissed these changes
on 2024-09-19
Address review comments
2676a6c7
yuslepukhin
dismissed their stale review
via 2676a6c7
1 year ago
edgchen1
approved these changes
on 2024-09-20
yuslepukhin
deleted the yuslepukhin/zeroK_Gemm branch 1 year ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub