llama.cpp
bb4f7a9e - memory : fix broken batch splits for recurrent cache (#14575)

Commit

71 days ago

memory : fix broken batch splits for recurrent cache (#14575) Splits producing more than one ubatch per batch for recurrent models were broken with #14512. This fixes it by moving the completeness check after the ubatch split loop.

References

#14575 - memory : fix broken batch splits for recurrent cache

Author

compilade

Parents

b8eeb874

llama.cpp bb4f7a9e - memory : fix broken batch splits for recurrent cache (#14575)

llama.cpp
bb4f7a9e - memory : fix broken batch splits for recurrent cache (#14575)