llama.cpp
865af990
- ggml : ggml_flash_attn_ext() support ALiBi (CUDA)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 year ago
ggml : ggml_flash_attn_ext() support ALiBi (CUDA) ggml-ci
References
#7192 - ggml : full ALiBi support
Author
ggerganov
Parents
f7055d31
Loading