Fixes default value of `softmax_scale` in `PhiFlashAttention2`. #28537
fix(phi): Phi does not use softmax_scale in Flash-Attention.
6a5ad209
chore(docs): Update Phi docs.
baf8d3ef
gugarosa
marked this pull request as ready for review 2 years ago
gugarosa
deleted the fix-phi-tune branch 2 years ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub