correction of the attn.v.weight quantization for IQ3_XS #6209
correction of the attn.v.weight quantization for IQ3_XS
54e252ea
ikawrakow
approved these changes
on 2024-03-22
ggerganov
merged
e80f06d2
into master 1 year ago
Nexesenex
deleted the patch-1 branch 1 year ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub