llama.cpp
8faa1d4d - CUDA: faster non-contiguous concat (#10760)

Commit
327 days ago
CUDA: faster non-contiguous concat (#10760) * faster uncontiguous concat * Use a lambda to avoid code duplication Co-authored-by: Diego Devesa <slarengh@gmail.com> * Update ggml/src/ggml-cuda/concat.cu * add constexpr and static assert --------- Co-authored-by: Diego Devesa <slarengh@gmail.com>
Author
Parents
Loading