llama.cpp
750f60c0
- CUDA: fix Pascal FA, deq. KV to FP16 for batch > 8 (#7681)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 year ago
CUDA: fix Pascal FA, deq. KV to FP16 for batch > 8 (#7681)
References
#7681 - CUDA: fix Pascal FA, deq. KV to FP16 for batch > 8
Author
JohannesGaessler
Parents
9b596417
Loading