[Bugfix][Kernel] Fix per-token/per-channel quantization for Hopper scaled mm #12696
Fix per-token/per-channel quantization for Hopper scaled mm
fd8f32fd
Merge branch 'main' into fix_cutlass_group_checks
be7888da
tlrmchlsmth
marked this pull request as ready for review 318 days ago
mgoin
approved these changes
on 2025-02-03
simon-mo
merged
c11de33d
into main 318 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub