onnxruntime
68c832d5 - Fix buffer overrun in 4b dequant cuda (#18780)

Commit
2 years ago
Fix buffer overrun in 4b dequant cuda (#18780) ### Description Bugfix: Dequantize4BitsKernel buffer overrun when the input matrix has less than the number of blocks that a single thread block can handle. ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. -->
Author
Parents
Loading