llama.cpp
4696d567
- CUDA: fix crash on large batch size for quant. MoE (#13537)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
216 days ago
CUDA: fix crash on large batch size for quant. MoE (#13537)
References
#13537 - CUDA: fix crash on large batch size for quant. MoE
Author
JohannesGaessler
Parents
b7d26720
Loading