llama.cpp
27208bf6 - CUDA: add bf16 and f32 support to cublas_mul_mat_batched (#14361)

Commit

77 days ago

CUDA: add bf16 and f32 support to cublas_mul_mat_batched (#14361) * CUDA: add bf16 and f32 support to cublas_mul_mat_batched * Review: add type traits and make function more generic * Review: make check more explicit, add back comments, and fix formatting * Review: fix formatting, remove useless type conversion, fix naming for bools

References

#14361 - CUDA: add bf16 and f32 support to cublas_mul_mat_batched

Author

am17an

Parents

63a7bb3c

llama.cpp 27208bf6 - CUDA: add bf16 and f32 support to cublas_mul_mat_batched (#14361)

llama.cpp
27208bf6 - CUDA: add bf16 and f32 support to cublas_mul_mat_batched (#14361)