onnxruntime
[CPU] GQA supports head_sink input for smooth softmax
#25269
Merged

Loading