Add batched and strided batched gemm as TunableOp #13841
cloudhan
marked this pull request as ready for review 3 years ago
abudup
commented
on 2022-12-06
cloudhan
force pushed
from
e77b2e7d
to
5a18f264
3 years ago
abudup
commented
on 2022-12-14
cloudhan
force pushed
from
5a18f264
to
e4206e3b
3 years ago
cloudhan
force pushed
from
be7ef039
to
5ebd02ed
3 years ago
cloudhan
force pushed
from
5ebd02ed
to
cb1d1b89
3 years ago
abudup
dismissed these changes
on 2023-01-04
cloudhan
dismissed their stale review
via f46305a3
3 years ago
cloudhan
force pushed
from
cb1d1b89
to
f46305a3
3 years ago
abudup
dismissed these changes
on 2023-01-05
Add tunable strided batch gemm composed from rocblas and ck
28b52abe
Adjust gemm_test.py to be unified with strided_batched_gemm_test.py
438bc889
Add batched gemm tunable op
b24de039
Remove staled TODOs
51e339a9
Use tunable StridedBatchedGemm for MatMul
d488f104
Switch remaining rocblasGemmHelper usage in Gemm to tunable Gemm
4ee27326
Switch contrib ops to use tunable gemms
c412f5eb
Fix batched gemm for ke
94b45323
Add tests for alpha and beta
ec6bc368
Update after rebase onto main
c52bafdc
Fix size calculation
433f90b3
Fix typo
09c7e594
Factor out the `constexpr if`s
9bace1eb
Fix
af875bdc
Minor
eb34e43c
Use new API after rebase
7a6bf14f
Enable all ck tests after ck update
5b439ab5
Improve test speed for all gemms
19107853
cloudhan
dismissed their stale review
via 19107853
3 years ago
cloudhan
force pushed
from
f46305a3
to
19107853
3 years ago
cloudhan
merged
be879c11
into main 3 years ago
cloudhan
deleted the guangyunhan/more-tunable-gemm branch 3 years ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub