vllm
68c4421b
- [AMD][Quantization] Add TritonScaledMMLinearKernel since int8 is broken for AMD (#12282)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 year ago
[AMD][Quantization] Add TritonScaledMMLinearKernel since int8 is broken for AMD (#12282) Signed-off-by: Randall Smith <Randall.Smith@amd.com>
References
#12282 - [AMD][Quantization] Add TritonScaledMMLinearKernel since int8 is broken for AMD
Author
rasmith
Parents
aea94362
Loading