ggml : skip nops in compute_forward - SemanticDiff

Commit

2 years ago

ggml : skip nops in compute_forward

References

#3749 - cuda : add batched cuBLAS GEMM for faster attention

Author

ggerganov

ggerganov

Parents

Loading