[AMD][Quantization] Add TritonScaledMMLinearKernel since int8 is broken for AMD #12282
TritonScaledMMLinearKernel implementation
9e8bad6c
mgoin
approved these changes
on 2025-01-21
Add regression test for rocm w8a8
daf9a719
remote unused import
9c11d5c1
ruff
4e4d633e
mgoin
approved these changes
on 2025-01-22
mgoin
enabled auto-merge (squash) 325 days ago
mgoin
merged
68c4421b
into main 324 days ago
Assignees
No one assigned
Labels
quantization
ready
Login to write a write a comment.
Login via GitHub