ggml
CUDA/HIP: honor GGML_PREC_F32 in the flash-attention tile kernel
#1536
Open

CUDA/HIP: honor GGML_PREC_F32 in the flash-attention tile kernel #1536

RapidMark
CUDA/HIP: honor GGML_PREC_F32 in the flash-attention tile kernel
7ba34e6d

Login to write a write a comment.

Login via GitHub

Reviewers
No reviews
Assignees
No one assigned
Labels
Milestone