llama.cpp
55a1c5a5 - CUDA: add softmax broadcast (#14475)

Commit
202 days ago
CUDA: add softmax broadcast (#14475) * CUDA: add softmax broadcast * Pass by const ref * Review: Use blockDims for indexing, remove designated initializers * Add TODO for noncontigous input/output
Author
Committer
Parents
Loading