openvino
[ONNX FE][Transformation] GroupQueryAttention refine work for NPU static shape, only output current KV
#35230
Open

[ONNX FE][Transformation] GroupQueryAttention refine work for NPU static shape, only output current KV #35230

bopeng1234
bopeng1234 GQA static shape, only output current length KV, let APP insert curre…
c4d9d80b
bopeng1234 add comments
41a2a08e
github-actions github-actions added category: Core
github-actions github-actions added category: transformations
bopeng1234 bopeng1234 added do not merge

Login to write a write a comment.

Login via GitHub

Reviewers
No reviews
Assignees
No one assigned
Labels
Milestone