llama.cpp
71b69aa7 - cuda : fix flash_attn kernel to produce same results as CPU

Commit

2 years ago

cuda : fix flash_attn kernel to produce same results as CPU

Author

ggerganov

ggerganov

Committer

ggerganov

ggerganov

Parents

Loading