686)

Commit

1 year ago

`ggml_cuda_cpy` support for 4d tensors and float16->float32 upcasting (ggml/686) * added cuda float16->float32 upcasting to ggml_cuda_cpy * added ability to copy 4d tensors with the cuda backend * added tests for float16_>float32 upcast and 4d tensor cuda copys * added 4d copy test for float32->float16 copy * applied patch suggested by @iamlemec * simplify cpy tests --------- Co-authored-by: slaren <slarengh@gmail.com>

Author

balisujohn

Committer

ggerganov

Parents

a4b07c05

llama.cpp 625a699b - `ggml_cuda_cpy` support for 4d tensors and float16->float32 upcasting (ggml/686)

llama.cpp
625a699b - `ggml_cuda_cpy` support for 4d tensors and float16->float32 upcasting (ggml/686)