llama.cpp
665018c7
- CLBlast: Add broadcast support for matrix multiplication (#3402)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 year ago
CLBlast: Add broadcast support for matrix multiplication (#3402) Broadcast src0 into src1 across dimensions 2 and 3 when needed. This is required for models that use GQA.
References
#3402 - CLBlast: Support broadcasting for matrix multiplication and GQA
Author
shibe2
Parents
29a404a9
Loading