onnxruntime
Adding a sm80 q4 gemm kernel for small tiles
#20545
Merged

Adding a sm80 q4 gemm kernel for small tiles #20545

chenfucn
chenfucn small tile q4 gemm kernel for am80
dd6134a2
chenfucn add test
8f7f0d5f
github-advanced-security
github-advanced-security commented on 2024-05-02
chenfucn add compilation guard to sm80 specific instructions
9bf0f280
chenfucn skip lint in code relies on cutlass styling
71e8a357
chenfucn suppress 0 size array warning
733da870
chenfucn lint
2527bf9d
yufenglee
yufenglee approved these changes on 2024-06-05
chenfucn chenfucn merged 6fb09055 into main 1 year ago
chenfucn chenfucn deleted the cfu_small_q4gemm branch 1 year ago
yufenglee
chenfucn

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone