whisper.cpp
2d70cd36 - CUDA: optimize FA for GQA + large batches (llama/12014)

Commit
334 days ago
CUDA: optimize FA for GQA + large batches (llama/12014)
Committer
Parents
Loading