llama.cpp
Implementations for Q4_0_8_8 quantization based functions - AVX512 version of ggml_gemm_q4_0_8x8_q8_0
#9532
Merged

Implementations for Q4_0_8_8 quantization based functions - AVX512 version of ggml_gemm_q4_0_8x8_q8_0 #9532

Srihari-mcw
Srihari-mcw AVX512 version of ggml_gemm_q4_0_8x8_q8_0
a829583c
github-actions github-actions added ggml
Srihari-mcw
Srihari-mcw Remove zero vector parameter passing
7aee79bd
Srihari-mcw Rename functions and rearrange order of macros
7436d529
Srihari-mcw Edit commments
14d2abb8
ggerganov
ggerganov approved these changes on 2024-09-23
ggerganov style : minor adjustments
407910ff
ggerganov
ggerganov commented on 2024-09-23
Srihari-mcw Update x to start from 0
448e4a94
ggerganov ggerganov merged 1e7b9299 into master 1 year ago
max-krasnyansky
ymcki
max-krasnyansky
ymcki
Srihari-mcw
slaren
ymcki

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone