onnxruntime
Support seq_len > 64K in rotary embedding cuda kernel
#20204
Merged

Support seq_len > 64K in rotary embedding cuda kernel #20204

gh-yewang merged 2 commits into main from wangyems-patch-4
gh-yewang
gh-yewang handle seq_len > 64k
d83f619e
gh-yewang gh-yewang changed the title handle seq_len > 64k Support seq_len > 64K in rotary embedding cuda kernel 2 years ago
gh-yewang gh-yewang requested a review from yufenglee yufenglee 2 years ago
yufenglee
yufenglee commented on 2024-04-05
gh-yewang Update rotary_embedding_impl.cu
09a8bdfe
tianleiwu
tianleiwu approved these changes on 2024-04-06
gh-yewang gh-yewang merged cc3faba6 into main 2 years ago
gh-yewang gh-yewang deleted the wangyems-patch-4 branch 2 years ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone