llama.cpp
38355c6c - CUDA: use registers instead of smem in topk-moe (#16647)

Commit
58 days ago
CUDA: use registers instead of smem in topk-moe (#16647) Uses the technique used in the vulkan PR #16641. Neat trick!
Author
Parents
Loading