llama.cpp
5c86c9ed
- CUDA: fix crash on large batch size for MoE models (#13384)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
222 days ago
CUDA: fix crash on large batch size for MoE models (#13384)
References
#13384 - CUDA: fix crash on large batch size for MoE models
Author
JohannesGaessler
Parents
efb8b47e
Loading