llama.cpp
quantize: be able to explicitly specify quantization type of output and token embedding tensors
#6239
Merged

quantize: be able to explicitly specify quantization type of output and token embedding tensors #6239

ggerganov merged 2 commits into master from ik/quantize_not_repeating
ikawrakow
quantize: be able to specify the output tensor type
7883796f
quantize: be able to specify the token embedding tensor type
0e826d12
slaren
slaren approved these changes on 2024-03-22
ggerganov
ggerganov approved these changes on 2024-03-22
ggerganov ggerganov merged 1d0331c1 into master 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone