llama.cpp
2ff3354c - memory : fix broken batch splits for recurrent cache

Commit
215 days ago
memory : fix broken batch splits for recurrent cache Splits producing more than one ubatch per batch for recurrent models were broken with #14512. This fixes it by moving the completeness check after the ubatch split loop.
Author
Committer
Parents
Loading