ggml : full ALiBi support (#7192)

Commit

1 year ago

ggml : full ALiBi support (#7192) * ggml : full ALiBi support * ggml : update ggml_soft_max_ext() CUDA, SYCL * ggml : ggml_flash_attn_ext() support ALiBi (CPU) * ggml : ggml_flash_attn_ext() support ALiBi (Metal) * ggml : fix warning * ggml : ggml_flash_attn_ext() support ALiBi (CUDA) ggml-ci * ggml : fix assert message * vulkan : add dev notes * ggml : require mask when using ALiBi ggml-ci * convert : fix convert for refact models

References

#7192 - ggml : full ALiBi support

Author

ggerganov

Parents

e8496488

llama.cpp 9cb317f7 - ggml : full ALiBi support (#7192)

llama.cpp
9cb317f7 - ggml : full ALiBi support (#7192)