llama.cpp
34c961b1 - CUDA/HIP: Fix fattn-vec-* when device warp size is not 32 (#12315)

Commit

298 days ago

CUDA/HIP: Fix fattn-vec-* when device warp size is not 32 (#12315) When fattn-wmma was ported over to warp64 various bits that also touch fattn-vec where converted to selectable warp size, however the fattn-vec kernels dont work with 64 wide warps for now, so we need to avoid launching them with parameters for warp64

References

#12315 - CUDA/HIP: Fix fattn-vec-* when device warp size is not 32

Author

IMbackK

Parents

7841fc72

llama.cpp 34c961b1 - CUDA/HIP: Fix fattn-vec-* when device warp size is not 32 (#12315)

llama.cpp
34c961b1 - CUDA/HIP: Fix fattn-vec-* when device warp size is not 32 (#12315)