llama.cpp
ggml : full ALiBi support
#7192
Merged

ggml : full ALiBi support #7192

ggerganov merged 10 commits into master from gg/refactor-alibi-2
ggerganov
ggerganov ggml : full ALiBi support
7fdca334
ggerganov ggml : update ggml_soft_max_ext() CUDA, SYCL
d0592d49
ggerganov ggerganov force pushed from 922a5b3d to d0592d49 1 year ago
ggerganov ggml : ggml_flash_attn_ext() support ALiBi (CPU)
166e60bf
ggerganov ggerganov force pushed from a4c7cf7e to 166e60bf 1 year ago
mofosyne mofosyne added Review Complexity : High
mofosyne mofosyne added enhancement
mofosyne mofosyne added model
ggerganov ggml : ggml_flash_attn_ext() support ALiBi (Metal)
97c27f59
ggerganov ggerganov force pushed from ba4d12ab to 97c27f59 1 year ago
ggerganov
ggerganov commented on 2024-05-10
ggerganov ggml : fix warning
f7055d31
JoanFM
JoanFM commented on 2024-05-10
ggerganov ggml : ggml_flash_attn_ext() support ALiBi (CUDA)
865af990
ggerganov
ggerganov
ggerganov ggerganov marked this pull request as ready for review 1 year ago
ggerganov ggerganov requested a review from slaren slaren 1 year ago
JoanFM
JoanFM
JoanFM requested changes on 2024-05-10
ggerganov ggml : fix assert message
536983b1
JoanFM
JoanFM commented on 2024-05-10
ggerganov vulkan : add dev notes
397b1f8f
NeoZhangJianyu
NeoZhangJianyu approved these changes on 2024-05-10
slaren
slaren approved these changes on 2024-05-10
ggerganov ggml : require mask when using ALiBi
0faf92e7
ggerganov ggerganov force pushed from a6166051 to 0faf92e7 1 year ago
mofosyne mofosyne added refactoring
ggerganov convert : fix convert for refact models
03e940cd
ggerganov ggerganov merged 9cb317f7 into master 1 year ago
JohannesGaessler
github-actions

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone