xla
Support int4 weight in quantized matmul/linear
#7235
Merged

Support int4 weight in quantized matmul/linear #7235

lsy323 merged 24 commits into master from lsiyuan/int4-quant-ops
lsy323
add quantized layers per channel
f7c200af
enhance tests, clean up
f48c666a
add q ops to ci
65f6fcab
add README
c042e2fa
update readme
878d7e78
update readme
b4542372
initial commit for int4
e69627fe
lsy323 add some tests
810f1049
use literal
b8ed810c
lsy323 fix bad malloc
27acbbb7
lsy323 add a subchannel test
7c52bf92
add tests
9fd7caa4
lsy323 add TPU numerical check
fa29ba27
refactor
9c47f637
format
256a2616
merge
059053be
lsy323 lsy323 marked this pull request as ready for review 2 years ago
update docl
5c4c7f0e
rename to cast_int4
03f46f14
lsy323 lsy323 force pushed from 43301176 to 03f46f14 2 years ago
remove dup files
11be78b3
format
3a1d83f7
JackCaoG JackCaoG requested a review from JackCaoG JackCaoG 2 years ago
remove comment
62a0b17a
remove comment
5fe2f09e
JackCaoG
JackCaoG commented on 2024-06-10
JackCaoG
JackCaoG commented on 2024-06-10
JackCaoG
JackCaoG commented on 2024-06-10
JackCaoG
JackCaoG commented on 2024-06-10
JackCaoG
JackCaoG commented on 2024-06-10
JackCaoG
JackCaoG commented on 2024-06-10
JackCaoG
JackCaoG commented on 2024-06-10
JackCaoG
JackCaoG commented on 2024-06-10
JackCaoG
JackCaoG commented on 2024-06-10
JackCaoG
JackCaoG commented on 2024-06-10
remove unused pack unpack and test
9addde9c
lsy323 lsy323 requested a review from JackCaoG JackCaoG 2 years ago
lsy323
JackCaoG
JackCaoG approved these changes on 2024-06-10
lsy323 lsy323 added quantization
fix import
77c61a6c
lsy323 lsy323 merged ac371fb8 into master 2 years ago
miladm miladm assigned miladm miladm 2 years ago
miladm miladm assigned lsy323 lsy323 2 years ago
miladm miladm unassigned miladm miladm 2 years ago
lsy323 lsy323 deleted the lsiyuan/int4-quant-ops branch 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
Labels
Milestone