llama.cpp
CUDA: add head size 72 for flash-attn
#16962
Merged

Commits
  • CUDA: add head size 72
    theo77186 committed 227 days ago
Loading