llama.cpp
7538246e - cuda : add f32 to bf16 copy op (#12806)

Commit
252 days ago
cuda : add f32 to bf16 copy op (#12806) This allows BF16 KV-cache on CUDA.
Author
Parents
Loading