llama.cpp
d8c929ff - feat: Allow custom layer filters for hybrid recurrent

Commit

322 days ago

feat: Allow custom layer filters for hybrid recurrent This should help support architectures like Falcon H1 where there is overlap between layers that need attention and recurrent caches. https://github.com/ggml-org/llama.cpp/pull/13979#discussion_r2140748922 Branch: HybridRecurrentCache Signed-off-by: Gabe Goodhart <ghart@us.ibm.com>

References

#13979 - Hybrid recurrent cache

Author

gabe-l-hart

Committer

gabe-l-hart

Parents

d5d7628b

llama.cpp d8c929ff - feat: Allow custom layer filters for hybrid recurrent

llama.cpp
d8c929ff - feat: Allow custom layer filters for hybrid recurrent