onnxruntime
197da135 - Implement quantized Attention on cpu (#4111)

Commit
5 years ago
Implement quantized Attention on cpu (#4111) * Implement QAttention on CPU * support QAttention in quantization tool * refine attention code * add more unit tests
Author
Parents
Loading