llama.cpp
b37124d2
- vulkan: handle quantize_q8_1 overflowing the max workgroup count (#18515)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
27 days ago
vulkan: handle quantize_q8_1 overflowing the max workgroup count (#18515) * vulkan: handle quantize_q8_1 overflowing the max workgroup count * vulkan: Fix small tile size matmul on lavapipe * fix mul_mat_id failures
References
#18515 - vulkan: handle quantize_q8_1 overflowing the max workgroup count
Author
jeffbolznv
Parents
eadc4184
Loading