DeepSpeed
adding 8bit dequantization kernel for asym fine-grained block quantization in zero-inference
#4450
Merged

adding 8bit dequantization kernel for asym fine-grained block quantization in zero-inference #4450

tjruwase merged 15 commits into master from styoun/zero-inf-8bit-q
stephen-youn
styoun kernels added for asym fine-grained block quantization with 8bits
a9328b9f
styoun formatting
347dea02
stephen-youn stephen-youn requested a review from RezaYazdaniAminabadi RezaYazdaniAminabadi 2 years ago
stephen-youn stephen-youn requested a review from jeffra jeffra 2 years ago
stephen-youn stephen-youn requested a review from mrwyattii mrwyattii 2 years ago
stephen-youn stephen-youn requested a review from awan-10 awan-10 2 years ago
stephen-youn stephen-youn requested a review from cmikeh2 cmikeh2 2 years ago
stephen-youn stephen-youn requested a review from arashb arashb 2 years ago
stephen-youn stephen-youn requested a review from tjruwase tjruwase 2 years ago
styoun clean up the code
a3734881
stephen-youn Merge branch 'master' into styoun/zero-inf-8bit-q
561cd16f
tjruwase
tjruwase commented on 2023-10-05
styoun rename quantize_int4.cu to quantize_intX.cu
e0ad1673
styoun Merge branch 'styoun/zero-inf-8bit-q' of https://github.com/microsoft…
90858e7c
styoun rename test_int4_quantization.py to test_intX_quantization.py
2d341405
styoun "rename test_int4_quantization.py to test_intX_quantization.py"
b9896cd3
styoun rename
359c3e58
tjruwase
tjruwase
tjruwase commented on 2023-10-05
tjruwase
tjruwase commented on 2023-10-05
styoun fix after the pr comments
1ea4bf45
stephen-youn stephen-youn closed this 2 years ago
stephen-youn stephen-youn reopened this 2 years ago
styoun increased coverage of QuantLinear test
0b216378
tjruwase Merge branch 'master' into styoun/zero-inf-8bit-q
571ab839
styoun formatting
68c258c3
styoun Merge branch 'styoun/zero-inf-8bit-q' of https://github.com/microsoft…
7c7310dc
tjruwase Merge branch 'master' into styoun/zero-inf-8bit-q
c9704348
tjruwase
tjruwase approved these changes on 2023-10-10
tjruwase tjruwase merged 6c86ff39 into master 2 years ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone