llama.cpp
b0311c16
- CUDA: fix padding of GQA to power of 2 in FA (#19115)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
3 days ago
CUDA: fix padding of GQA to power of 2 in FA (#19115)
References
#19115 - CUDA: fix padding of GQA to power of 2 in FA
Author
JohannesGaessler
Parents
8f80d1b2
Loading