add template to support more dtypes
6be14123
update cmake list
252ac0f8
fix typo
f98c9e5d
fix compile cpu
902bf359
make different dtype works
fef8459f
use bf16 on CPU
55cbaa0d
fix state2 dtype
bbef95b3
remove torch
e8425135
rm torch
d4473fa9
enable float to bf16
dea8dd63
rm dequantizeBlockwise4bitCpu
e9bb4fe1
fix check
cdc8d5e0
enable dequant 4bit kernel
baacfac2
fix typo
eec35212
fix typo
d7cc1c5e
fix dequantize
124b754e
fix
0f918c72
fix
e1a8b20d
test
eab45c85
fix
d9f5dd8e
fix
070f8a08
fix
a84addfe
fix
c4bb6607
fix
4ba13fd3
change input param
c0d05ec1
fix typo
62a16a6e
fix input param
d9ad8282
spliut 8bit and 4bit
09ed6cbf
fix typo
a3f7b611
fix typo
47084701
fix input params
1dfe9f71
fix input params
00289c42
fix
a2578baa
fix typo
72033dc1
enable dequant4bit
1c20ae83
fix
7552fe22
fix
8b32a39c
fix reverse
8f1cc369
fix dequant 4bit fallback path
49d242a8
fix fp4 dequant
4a9a6dc1
Merge branch 'main' into cpu_kernel
6bcd19e3
rm _Float16
d7e981d9
tmp codes
48739b09
enable gemv
f784be86
change to 4bit dequant
92192c9f
fix def
bd02e712
fix type
85200691
fix absmax dtype
e921cbb5
fix type
9b5d97a3
fix compile and type
fd6cff13
enable gemv
46d6e47a
fix shape
3271c308
fix lib name
176a2b61
debug
196984a7
update
76521152
enable gemv 4bit bf16
ea0e6497
enable avx512 check
9277d24d
fix check
4fb315bc
fix endif
81f19844
fix format
0f78bada
fix format
fcb84565
fix def
c5e18945
jiqing-feng
marked this pull request as ready for review 33 days ago
rebase
f2029c6e
fix position
df1d669a
fix format
bb3ac8da
rm duplicated func
26b56852
Merge branch 'main' into cpu_fused_kernel
445725b3
rm useless code comments
580010cc
fix out shape
57b89bfa
Merge branch 'main' into cpu_fused_kernel
302a5fe3
fix comments
de5fb9c9
add reverse format
6858a90b
check avx512bf15
3b3d609b
fix has_avx512bf16
fbb911b6
fix tests
3179b42b
fix absmax shhape
0c88d436
fix compile
feb8ad22
fix tests
c6b714d8
fix test_gemv
54971118
Merge branch 'main' into cpu_fused_kernel
0045c4b0
jiqing-feng
force pushed
from
d2de0f5c
to
0045c4b0
22 days ago
disable binsearch
bdb25c04
fix lint
6cec12dc
fix save
692a8e15
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub