llama.cpp
ggml : add IQ2 to test-backend-ops + refactoring
#4990

Merged

ggml : add IQ2 to test-backend-ops + refactoring #4990

ggerganov merged 6 commits into master from gg/iq2-refactor-and-tests

ggml : add IQ2 to test-backend-ops + refactoring

bc0bb300

cuda : update supports_op for IQ2

e9a5d54b

ci : enable LLAMA_CUBLAS=1 for CUDA nodes

36feaeb4

cuda : fix out-of-bounds-access in `mul_mat_vec_q`

b7ddc8bf

cebtenzzre commented on 2024-01-16

tests : avoid creating RNGs for each Q tensor

8eb8fd94

tests : avoid creating RNGs for each tensor

49bafe09

ggerganov added sync

ggerganov merged 38566680 into master 2 years ago

Reviewers

cebtenzzre

Assignees

No one assigned

Labels

sync

Milestone

No milestone