llama.cpp
7de5c7ca - CUDA: add set rows for f32 and f16 (#14551)

Commit
89 days ago
CUDA: add set rows for f32 and f16 (#14551) * CUDA: add set rows for f32 and f16 * Review: change kernel params, use strides from host * Use 1-d kernel * Review: use int64_t for blockDim.x, rename nb->s for clarity
Author
Parents
Loading