metal : gemma2 flash attention support #9159
metal : gemma2 flash attention support
054203ae
slaren
marked this pull request as draft 1 year ago
use precise::tanh
edc2e273
slaren
marked this pull request as ready for review 1 year ago
ggerganov
approved these changes
on 2024-08-26
slaren
merged
0c41e03c
into master 1 year ago
slaren
deleted the sl/metal-logit-softcap branch 1 year ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub