llama.cpp
197c0068 - Allow multiple copy function pointers for CUDA graph kernel param updates (#7565)

Commit

1 year ago

Allow multiple copy function pointers for CUDA graph kernel param updates (#7565) CUDA graphs require parameter updates to kernels associated with GGML_OP_CPY nodes. Previously the implementation only checked for a single CUDA kernel in such nodes, but this caused a bug in cases where 2 such kernels exist. This fixes the issue by using a vector to allow multiple function pointers to be stored and checked against. Fixes #7942

References

#7565 - Allow multiple copy function pointers for CUDA graph kernel updates

Author

agray3

Parents

95f84d5c

llama.cpp 197c0068 - Allow multiple copy function pointers for CUDA graph kernel param updates (#7565)

llama.cpp
197c0068 - Allow multiple copy function pointers for CUDA graph kernel param updates (#7565)