llama.cpp
ggml : Implementations for Q4_0_8_8 quantization based functions - RISC-V vector version
#10029
Merged

ggml : Implementations for Q4_0_8_8 quantization based functions - RISC-V vector version #10029

ggerganov merged 7 commits into ggml-org:master from xctan:rvv_q4_0_8x8
xctan
xctan ggml : RISC-V vector gemv for q4_0_8x8
9bfecf42
xctan ggml : Added WIP rvv q4_0_8x8 gemm
3f7fdf24
xctan ggml : Added initial implementation of rvv gemm
238cd667
xctan ggml : optimize gemm to avoid register spillover
c039415e
xctan ggml : Fix GCC rvv load alignment issue
78c78e2a
github-actions github-actions added ggml
xctan
ggerganov
ggerganov approved these changes on 2024-10-25
xctan ggml : Format gemm rvv code
37057a0f
xctan
xctan ggml : Fix a typo in RVV q4_0_8_8 GEMM
274a7720
ggerganov ggerganov merged fc83a9e5 into master 345 days ago
ixgbe
xctan

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone