onnxruntime
Optimize quantized LSTM
#8634
Merged

Optimize quantized LSTM #8634

zhanghuanrong merged 11 commits into master from zhalei/qlstm3
zhanghuanrong
zhanghuanrong optimize some lstm gate computation. Remove no need string constructi…
71da820c
zhanghuanrong change gcc optimization flags for computation bound logics in rnn_hel…
30f7e8a3
zhanghuanrong better qgemm for M=1
5dc2f787
zhanghuanrong Some improve on avx512
a0cf1131
zhanghuanrong zhanghuanrong requested a review from yufenglee yufenglee 4 years ago
zhanghuanrong zhanghuanrong requested a review from tracysh tracysh 4 years ago
zhanghuanrong zhanghuanrong requested a review 4 years ago
zhanghuanrong add condition to limit GCC related marcros
aa21fe8f
zhanghuanrong Correct QGemm assembly for M=1 AVX2 optimization to pass mlas_test.
04c3d924
zhanghuanrong Fix rnn_helper build issue for wasm.
ab7ed387
zhanghuanrong better asm code here according to feedbacks.
c8508dec
zhanghuanrong Remove customized vectorize and unroll option for GCC.
c985ad68
zhanghuanrong Better restrict semantic for merge_lstm_gates_to_memory() by adding i…
5fcb50ad
yufenglee yufenglee changed the title Zhalei/qlstm3 Optimize quantized LSTM 4 years ago
yufenglee
yufenglee dismissed these changes on 2021-08-10
zhanghuanrong Force CI restart as it stucked by the onnxruntime-python-checks-ci-pi…
de1b7ba0
zhanghuanrong zhanghuanrong dismissed their stale review via de1b7ba0 4 years ago
yufenglee
yufenglee approved these changes on 2021-08-12
zhanghuanrong zhanghuanrong merged 76dfe810 into master 4 years ago
zhanghuanrong zhanghuanrong deleted the zhalei/qlstm3 branch 4 years ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone