llama.cpp
llama : quantize attention results
#1103
Open

llama : quantize attention results #1103

ggerganov wants to merge 1 commit into master from quant-attn
ggerganov
ggerganov ggerganov added performance
ggerganov ggerganov added generation quality
ggerganov llama : quantize attention results
4b8d5e38
ggerganov ggerganov force pushed to 4b8d5e38 3 years ago
ggerganov ggerganov assigned ggerganov ggerganov 3 years ago
ggerganov ggerganov removed performance
ggerganov ggerganov removed generation quality
ggerganov ggerganov added demo
ggerganov ggerganov unassigned ggerganov ggerganov 183 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
No reviews
Assignees
No one assigned
Labels
Milestone