whisper.cpp
f8c75dc4
- CUDA: fix crash on large batch size for MoE models (llama/13384)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
232 days ago
CUDA: fix crash on large batch size for MoE models (llama/13384)
References
#3148 - sync : ggml
Author
JohannesGaessler
Committer
ggerganov
Parents
00c80567
Loading