llama.cpp
2ff3354c - memory : fix broken batch splits for recurrent cache

Commit

1 year ago

memory : fix broken batch splits for recurrent cache Splits producing more than one ubatch per batch for recurrent models were broken with #14512. This fixes it by moving the completeness check after the ubatch split loop.

References

compilade/fix-recurrent-batch-init

#14575 - memory : fix broken batch splits for recurrent cache

Author

compilade

Committer

compilade

Parents

e1a70590

llama.cpp 2ff3354c - memory : fix broken batch splits for recurrent cache

llama.cpp
2ff3354c - memory : fix broken batch splits for recurrent cache