llama.cpp
CUDA: add FP32 FlashAttention vector kernel
#7188
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
4
Changes
View On
GitHub
Commits
CUDA: add FP32 FlashAttention vector kernel
JohannesGaessler
committed
2 years ago
fixup! CUDA: add FP32 FlashAttention vector kernel
JohannesGaessler
committed
2 years ago
fixup! fixup! CUDA: add FP32 FlashAttention vector kernel
JohannesGaessler
committed
2 years ago
fixup! fixup! fixup! CUDA: add FP32 FlashAttention vector kernel
JohannesGaessler
committed
2 years ago
Loading