llama.cpp
71b69aa7 - cuda : fix flash_attn kernel to produce same results as CPU

Commit
1 year ago
cuda : fix flash_attn kernel to produce same results as CPU
Author
Committer
Parents
Loading