onnxruntime
[CUDA] Benchmark GQA on popular LLM models
#20646
Merged

[CUDA] Benchmark GQA on popular LLM models #20646

tianleiwu merged 1 commit into main from tlwu/benchmark_gqa_on_llm
tianleiwu
tianleiwu benchmark GQA on popular models
ea76f488
tianleiwu tianleiwu requested a review from kunal-vaishnavi kunal-vaishnavi 1 year ago
tianleiwu tianleiwu requested a review from yufenglee yufenglee 1 year ago
tianleiwu tianleiwu requested a review from wangyems wangyems 1 year ago
kunal-vaishnavi
kunal-vaishnavi approved these changes on 2024-05-10
tianleiwu tianleiwu merged 85facd67 into main 1 year ago
tianleiwu tianleiwu deleted the tlwu/benchmark_gqa_on_llm branch 1 year ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone