llama.cpp
7de5c7ca
- CUDA: add set rows for f32 and f16 (#14551)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
89 days ago
CUDA: add set rows for f32 and f16 (#14551) * CUDA: add set rows for f32 and f16 * Review: change kernel params, use strides from host * Use 1-d kernel * Review: use int64_t for blockDim.x, rename nb->s for clarity
References
#14551 - CUDA: add set rows for f32 and f16
Author
am17an
Parents
8eff9554
Loading