llama.cpp
Use fp32 in cuBLAS V100 to avoid overflows, env variables to override cuBLAS compute type
#19959
Merged

Use fp32 in cuBLAS V100 to avoid overflows, env variables to override cuBLAS compute type #19959

am17an merged 13 commits into ggml-org:master from forced_cublas_new
wallentri88
wallentri88 Update ggml-cuda.cu
d2237d39
wallentri88 Update ggml-cuda.cu
c5ec63ac
wallentri88
github-actions github-actions added Nvidia GPU
github-actions github-actions added ggml
wallentri88 Update build.md
b1939bd4
github-actions github-actions added documentation
wallentri88 Update build.md
5b51d4d6
JohannesGaessler
JohannesGaessler commented on 2026-03-02
wallentri88
wallentri88 Update ggml/src/ggml-cuda/ggml-cuda.cu
787710d0
JohannesGaessler
JohannesGaessler commented on 2026-03-03
wallentri88 Merge branch 'ggml-org:master' into forced_cublas_new
57641d55
wallentri88 Update ggml-cuda.cu
b84198c7
wallentri88 Update build.md
2e7693be
wallentri88 wallentri88 requested a review from JohannesGaessler JohannesGaessler 24 days ago
am17an
JohannesGaessler
JohannesGaessler approved these changes on 2026-03-04
ORippler
ORippler commented on 2026-03-04
ORippler
ORippler commented on 2026-03-04
wallentri88 Update ggml/src/ggml-cuda/ggml-cuda.cu
d8ff8ebe
wallentri88 Update build.md
ad27ec11
wallentri88
JohannesGaessler
wallentri88
wallentri88
am17an
wallentri88
am17an
JohannesGaessler
wallentri88 Merge branch 'ggml-org:master' into forced_cublas_new
808594df
wallentri88 Update ggml-cuda.cu
77a8e2a6
wallentri88 Update ggml-cuda.cu
d23aac8b
wallentri88
am17an am17an merged f2c0dfb7 into master 14 days ago
wallentri88 wallentri88 deleted the forced_cublas_new branch 13 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone