llama.cpp
71b69aa7 - cuda : fix flash_attn kernel to produce same results as CPU

Commit
2 years ago
cuda : fix flash_attn kernel to produce same results as CPU
Author
Committer
Parents
Loading