llama.cpp
34c961b1 - CUDA/HIP: Fix fattn-vec-* when device warp size is not 32 (#12315)

Commit
179 days ago
CUDA/HIP: Fix fattn-vec-* when device warp size is not 32 (#12315) When fattn-wmma was ported over to warp64 various bits that also touch fattn-vec where converted to selectable warp size, however the fattn-vec kernels dont work with 64 wide warps for now, so we need to avoid launching them with parameters for warp64
Author
Parents
Loading