llama.cpp
ggml : add IQ2 to test-backend-ops + refactoring
#4990
Merged

ggml : add IQ2 to test-backend-ops + refactoring #4990

ggerganov merged 6 commits into master from gg/iq2-refactor-and-tests
ggerganov
ggerganov ggml : add IQ2 to test-backend-ops + refactoring
bc0bb300
ggerganov cuda : update supports_op for IQ2
e9a5d54b
ggerganov ci : enable LLAMA_CUBLAS=1 for CUDA nodes
36feaeb4
ggerganov cuda : fix out-of-bounds-access in `mul_mat_vec_q`
b7ddc8bf
cebtenzzre
cebtenzzre commented on 2024-01-16
ggerganov tests : avoid creating RNGs for each Q tensor
8eb8fd94
ggerganov tests : avoid creating RNGs for each tensor
49bafe09
ggerganov ggerganov added sync
ggerganov ggerganov merged 38566680 into master 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone