llama.cpp
0fc1e820
- CUDA: faster large batch FA without tensor cores (#7314)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 year ago
CUDA: faster large batch FA without tensor cores (#7314)
References
#7314 - CUDA: faster large batch FA without tensor cores
Author
JohannesGaessler
Parents
82ca83db
Loading