llama.cpp
02d69881 - Improve cuBLAS performance by dequantizing on the GPU (#1065)

Commit

2 years ago

Improve cuBLAS performance by dequantizing on the GPU (#1065)

References

#1065 - Improve cuBLAS performance by dequantizing on the GPU

Author

slaren

slaren

Parents

Loading