llama.cpp
66d65ec2
- cuda: cap grid.y at 65535 in non-contiguous dequantize/convert kernels (#19999)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
2 days ago
cuda: cap grid.y at 65535 in non-contiguous dequantize/convert kernels (#19999)
References
#19999 - cuda: fix grid.y overflow in non-contiguous dequantize/convert kernels
Author
oobabooga
Parents
05728db1
Loading