llama.cpp
fcca0a70 - refact : fix convert script + zero out KV cache to avoid nans (#3523)

Comment changes are shownComment changes are hidden
Commit
1 year ago
refact : fix convert script + zero out KV cache to avoid nans (#3523) * refact : fix convert script + zero out KV cache to avoid nans * ggml : silu(-inf) should never happen * metal : assert various kernel requirements
Author
Parents
  • File
    convert-refact-hf-to-gguf.py
  • examples/parallel
    • File
      parallel.cpp
  • File
    ggml-metal.m
  • ggml-metal.metal
  • File
    ggml.c
  • File
    llama.cpp