llama.cpp
91c73601 - llama : add gguf_remove_key + remove split meta during quantize (#6591)

Commit
1 year ago
llama : add gguf_remove_key + remove split meta during quantize (#6591) * Remove split metadata when quantize model shards * Find metadata key by enum * Correct loop range for gguf_remove_key and code format * Free kv memory --------- Co-authored-by: z5269887 <z5269887@unsw.edu.au>
Author
Parents
Loading