llama.cpp
4b6fb652
- context : round n_tokens to next multiple of n_seqs when reserving
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
242 days ago
context : round n_tokens to next multiple of n_seqs when reserving This fixes RWKV inference which fails when ubatch.n_seq_tokens is 0.
References
compilade/fix-batch-reserve-rwkv
#14140 - context : round n_tokens to next multiple of n_seqs when reserving
Author
compilade
Committer
compilade
Parents
532802f9
Loading