CUDA: add set rows for f32 and f16 #14551
am17an
force pushed
271 days ago
am17an
force pushed
269 days ago
am17an
force pushed
269 days ago
CUDA: add set rows for f32 and f16
853bc5ec
Review: change kernel params, use strides from host
15e1b897
Use 1-d kernel
85e2a202
am17an
force pushed
to
85e2a202
266 days ago
Review: use int64_t for blockDim.x, rename nb->s for clarity
9deb7644
ggerganov
merged
7de5c7ca
into master 265 days ago
am17an
deleted the cuda_set_rows branch 164 days ago
Assignees
No one assigned
Labels
Nvidia GPU
examples
ggml
Login to write a write a comment.
Login via GitHub