whisper.cpp
whisper : add batched decoding
#1486
Merged

whisper : add batched decoding #1486

ggerganov merged 14 commits into master from batched
ggerganov
ggerganov whisper : add whisper_batch
3cbaaed0
ggerganov whisper : move kv_self to whisper_state
8b943f98
ggerganov whisper : full batched decoding support
91096daa
ggerganov whisper : fix memory leak in whisper_batch
3d24e35f
ggerganov whisper : fix mem leak again + remove oboslete function
b2123cb4
ggerganov ggerganov marked this pull request as ready for review 2 years ago
ggerganov whisper : clear kv cache when using whisper_decode API
d7760357
ggerganov whisper : speed-up sampling
9006946e
ggerganov whisper : fix decoders initializer
3ed9af34
ggerganov bench : add batch size 5 bench
ae1bd690
ggerganov whisper : add comment about the KV cache size
6c8a003a
ggerganov
bobqianic
ggerganov
ggerganov whisper : add check for max number of decoders
820f4589
ggerganov whisper : avoid starting sampling threads with bs=1
4c245ea1
bobqianic
ggerganov whisper : enable beam-search by default
b7c82a37
ggerganov
bobqianic
ggerganov
ggerganov cuda : sync llama.cpp fixes
270b1e48
ggerganov
bobqianic
ggerganov ggerganov merged b6c5f49b into master 2 years ago

Login to write a write a comment.

Login via GitHub

Reviewers
No reviews
Assignees
No one assigned
Labels
Milestone