llama.cpp
CUDA: add head size 72 for flash-attn
#16962
Merged
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
1
Changes
View On
GitHub
Commits
CUDA: add head size 72
theo77186
committed
227 days ago
Loading