llama.cpp
Implementations for Q4_0_8_8 quantization based functions - AVX512 version of ggml_gemm_q4_0_8x8_q8_0
#9532

Merged

Implementations for Q4_0_8_8 quantization based functions - AVX512 version of ggml_gemm_q4_0_8x8_q8_0 #9532

ggerganov merged 6 commits into ggml-org:master from Srihari-mcw:block_interleaving_q4_0_8_8_gemm_512

AVX512 version of ggml_gemm_q4_0_8x8_q8_0

a829583c

github-actions added ggml

Remove zero vector parameter passing

7aee79bd

Rename functions and rearrange order of macros

7436d529

Edit commments

14d2abb8

ggerganov approved these changes on 2024-09-23

style : minor adjustments

407910ff

ggerganov commented on 2024-09-23

Update x to start from 0

448e4a94

ggerganov merged 1e7b9299 into master 1 year ago

Reviewers

ggerganov

Assignees

No one assigned

Labels

ggml

Milestone

No milestone