Block interleaving support for Q4_K quantization for x86 AVX2 architecture #12332
Add block interleaving support for Q4_K quantization
fae86a56
Remove whitespaces and fix CI/CD issues
33bab804
Update pointer of bsums from int16_t to const int16_t
022ad356
Add vector version of quantize_q8_K_4x8 function
cba0df39
ggerganov
approved these changes
on 2025-03-19
Update code formatting based on review comments
adb86d7d
ggerganov
merged
3d82dbcb
into master 272 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub