llama.cpp
99ed03a2
- metal : improve decoding speed for batches of 2-16
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 year ago
metal : improve decoding speed for batches of 2-16
References
#3524 - metal : support MTLGPUFamily < Apple7, formatting, style
Author
ggerganov
Parents
f1782c68
Loading