use correct total length to fix static kv_cache performance #23615
use correct total length to fix static kv_cache performance
0a0a5ca7
guschmue
marked this pull request as ready for review 1 year ago
guschmue
merged
37750574
into main 1 year ago
guschmue
deleted the gs/gqa-static-kv-cache branch 1 year ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub