whisper.cpp
whisper : add full CUDA and Metal offloading
#1472
Merged

whisper : add full CUDA and Metal offloading #1472

ggerganov merged 21 commits into master from ggml-backend-no-sched
ggerganov
ggerganov whisper : migrate to ggml-backend
65975732
ggerganov whisper : fix logit reading
7e01486b
ggerganov whisper : fix tensor allocation during load
3dfbe649
ggerganov whisper : fix beam-search with CUDA
dcf9511d
ggerganov whisper : free backends + fix compile warning
12030358
ggerganov whisper : print when CUDA is enabled
3f5c1b7e
ggerganov Merge branch 'master' into ggml-backend-no-sched
0ab50253
ggerganov whisper : fix CoreML
a54d8c9d
ggerganov make : clean-up
d6dad64f
ggerganov Merge branch 'master' into ggml-backend-no-sched
728e1785
ggerganov talk : fix compile warning
c99e290a
ggerganov whisper : support ggml_conv with CUDA and Metal (#1473)
933c5bef
ggerganov whisper : clean-up
f53e1388
ggerganov
ggerganov ggerganov changed the title whisper : add full CUDA offloading whisper : add full CUDA and Metal offloading 2 years ago
slaren
ggerganov
bobqianic
slaren
ggerganov
ggerganov quantize-all : fix
3bfc43e3
slaren
dreness
ggerganov ggml : im2col opts
66bb2e94
ggerganov whisper : avoid whisper_model_data wrapper
0867e696
ggerganov whisper : add note that ggml_mul_mat_pad does not work with CUDA
b27726da
ggerganov
ggerganov commented on 2023-11-11
dreness
ggerganov
slaren
ggerganov
ggerganov whisper : factor out graph compute in common function
b6182293
slaren
slaren
ggerganov whisper : fixes
fc8565d0
ggerganov
slaren
slaren
ggerganov
ggerganov whisper : fix UB with measure buffers
40c66036
bobqianic
ggerganov whisper : try to fix the parallel whisper_state functionality (#1479)
5031f547
ggerganov ggerganov merged b0502836 into master 2 years ago
100tomer

Login to write a write a comment.

Login via GitHub

Reviewers
No reviews
Assignees
No one assigned
Labels
Milestone