llama.cpp
9b82476e - Add missing inference support for GPTNeoXForCausalLM (Pythia and GPT-NeoX base models) (#7461)

Commit
1 year ago
Add missing inference support for GPTNeoXForCausalLM (Pythia and GPT-NeoX base models) (#7461) * convert-hf : add conversion of bloom-style qkv tensor to gpt-style qkv (code borrowed from BloomModel) * llama : add inference support for LLM_ARCH_GPTNEOX * llama : add model types for every Pythia variant and GPT-NeoX Co-authored-by: Stanisław Szymczyk <sszymczy@gmail.com>
Author
Parents
Loading