llama.cpp
665018c7 - CLBlast: Add broadcast support for matrix multiplication (#3402)

Commit
1 year ago
CLBlast: Add broadcast support for matrix multiplication (#3402) Broadcast src0 into src1 across dimensions 2 and 3 when needed. This is required for models that use GQA.
Author
Parents
Loading