vllm
[Bugfix][Kernel] Fix per-token/per-channel quantization for Hopper scaled mm
#12696
Merged

[Bugfix][Kernel] Fix per-token/per-channel quantization for Hopper scaled mm #12696

tlrmchlsmth
tlrmchlsmth Fix per-token/per-channel quantization for Hopper scaled mm
fd8f32fd
github-actions
mergify
mergify mergify added needs-rebase
tlrmchlsmth Merge branch 'main' into fix_cutlass_group_checks
be7888da
mergify mergify removed needs-rebase
tlrmchlsmth
tlrmchlsmth commented on 2025-02-03
tlrmchlsmth tlrmchlsmth marked this pull request as ready for review 318 days ago
robertgshaw2-redhat
robertgshaw2-redhat approved these changes on 2025-02-03
LucasWilkinson
LucasWilkinson approved these changes on 2025-02-03
tlrmchlsmth tlrmchlsmth added ready
mgoin
mgoin approved these changes on 2025-02-03
simon-mo simon-mo merged c11de33d into main 318 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone