onnxruntime
4771256b - fix to avoid quantizing attention with varied q,k,v sizes (#9357)

Commit
4 years ago
fix to avoid quantizing attention with varied q,k,v sizes (#9357) * fix to avoid quantizing attention with varied q,k,v sizes * updated the changes to address the comments
Author
Parents
Loading