llama : add attention weights extraction API [EXPERIMENTAL] #20086
llama : add attention weights extraction API [EXPERIMENTAL]
14bf6d45
Use internal cb_eval for attention extraction to eliminate graph splits
b550fa6e
QuentinFuxa
force pushed
from
472702a8
to
5ac48b06
1 day ago
QuentinFuxa
force pushed
from
5ac48b06
to
e8734acb
1 day ago
QuentinFuxa
force pushed
from
e8734acb
to
b550fa6e
1 day ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub