llama.cpp
llama : add attention weights extraction API [EXPERIMENTAL]
#20086
Open

Loading