onnxruntime
4e670f7a - Support larger hidden size in Attention Cuda kernel (#7002)

Commit
4 years ago
Support larger hidden size in Attention Cuda kernel (#7002) * Support larger hidden size in Attention Cuda kernel * Update attention_transpose.cu * review comments * fix typo and add check in quantization * update readme
Author
Parents
Loading