llama.cpp
b264eddb - llama : fix Mamba pooled embeddings with multiple sequences

Commit
1 year ago
llama : fix Mamba pooled embeddings with multiple sequences Until the pooled embeddings are refactored to allow splitting across ubatches for causal embeddings, recurrent models can only process a single sequence per ubatch when calculating pooled embeddings.
Author
Parents
Loading