onnxruntime
use correct total length to fix static kv_cache performance
#23615
Merged

use correct total length to fix static kv_cache performance #23615

guschmue merged 1 commit into main from gs/gqa-static-kv-cache
guschmue
guschmue use correct total length to fix static kv_cache performance
0a0a5ca7
guschmue guschmue added ep:WebGPU
guschmue guschmue marked this pull request as ready for review 1 year ago
satyajandhyala
satyajandhyala approved these changes on 2025-02-11
guschmue guschmue merged 37750574 into main 1 year ago
guschmue guschmue deleted the gs/gqa-static-kv-cache branch 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone