llama.cpp
llama : fix shapes for bert/mpt q/k norm
#16409
Merged

Loading