[CUDA] enable causal in MultiHeadAttention #21852
causal MHA
8ae33e06
tianleiwu
marked this pull request as draft 1 year ago
update mha test
d6f3378c
tianleiwu
marked this pull request as ready for review 1 year ago
wangyems
approved these changes
on 2024-08-26
tianleiwu
merged
ad382120
into main 1 year ago
tianleiwu
deleted the tlwu/mha_causal branch 1 year ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub