onnxruntime
[CUDA] Fix performance bug in DecoderMaskedMultiheadAttention for BeamSearch
#17613
Merged

[CUDA] Fix performance bug in DecoderMaskedMultiheadAttention for BeamSearch #17613

hariharans29 merged 1 commit into main from hari/main_opt
hariharans29
hariharans29 Main Optimized
11f6a18e
hariharans29 hariharans29 changed the title Fix performance bug in DecoderMaskedMultiheadAttention [CUDA] Fix performance bug in DecoderMaskedMultiheadAttention for BeamSearch 2 years ago
hariharans29 hariharans29 requested a review from zhanghuanrong zhanghuanrong 2 years ago
hariharans29 hariharans29 requested a review from wangyems wangyems 2 years ago
hariharans29 hariharans29 requested a review from tianleiwu tianleiwu 2 years ago
hariharans29 hariharans29 requested a review from yufenglee yufenglee 2 years ago
hariharans29
hariharans29 commented on 2023-09-19
hariharans29
hariharans29 commented on 2023-09-19
tianleiwu
tianleiwu approved these changes on 2023-09-19
wangyems
wangyems approved these changes on 2023-09-19
hariharans29 hariharans29 merged c65e8920 into main 2 years ago
hariharans29 hariharans29 deleted the hari/main_opt branch 2 years ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone