llama.cpp
CUDA: loop over ne2*ne3 in case it overflows
#19538
Merged

CUDA: loop over ne2*ne3 in case it overflows #19538

am17an merged 2 commits into ggml-org:master from am17an:convert-cublas-fix
am17an
am17an CUDA: loop over ne2*ne3 in case it overflows
3b93d390
github-actions github-actions added Nvidia GPU
github-actions github-actions added ggml
JohannesGaessler
JohannesGaessler commented on 2026-02-12
am17an use fastdiv
0fd79e57
JohannesGaessler
am17an am17an marked this pull request as ready for review 125 days ago
JohannesGaessler
JohannesGaessler approved these changes on 2026-02-12
am17an am17an merged 5065da55 into master 124 days ago
am17an am17an deleted the convert-cublas-fix branch 124 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone