[`FA-2`] Add Flash Attention to `Phi` #27661
add FA and modify doc file
cdcd671c
test_flash_attn_2_generate_padding_right test overwritten
17185d89
comment
a33fa732
modify persimmon modeling file
e16090b1
added speedup graph
30cac2e5
more changes
9e22498e
susnato
force pushed
to
9e22498e
2 years ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub