Fix attention perf regression #8682
undo change in attention cpu
730a0a9c
fix perf regression
1eb0370e
fix build error
d300c2f2
remove undir_mask buffer
c2434b29
disable persistent softmax by default
ba98aba9
update comments
e4166581
gh-yewang
approved these changes
on 2021-08-11
tianleiwu
merged
f661c186
into master 4 years ago
tianleiwu
deleted the tlwu/fix_attention_cpu_perf_regression branch 4 years ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub