llama.cpp
4c6744b5 - cuda : remove duplicated cuBLAS GEMM code

Commit

2 years ago

cuda : remove duplicated cuBLAS GEMM code

References

#3776 - cuda : improve text-generation and batched decoding performance

Author

ggerganov

ggerganov

Parents

Loading