onnxruntime
Update GQA benchmark to support bfloat16
#26898
Merged

Update GQA benchmark to support bfloat16 #26898

tianleiwu merged 1 commit into main from tlwu/update_gqa_benchmark
tianleiwu
tianleiwu [CUDA] Update GQA benchmark
667969cf
tianleiwu tianleiwu requested a review from nenad1002 nenad1002 70 days ago
tianleiwu tianleiwu requested a review from apsonawane apsonawane 70 days ago
tianleiwu tianleiwu requested a review from kunal-vaishnavi kunal-vaishnavi 70 days ago
kunal-vaishnavi
kunal-vaishnavi approved these changes on 2026-01-05
tianleiwu tianleiwu merged 5307dc59 into main 67 days ago
tianleiwu tianleiwu deleted the tlwu/update_gqa_benchmark branch 67 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone