llama.cpp
9dba9f53 - (Bugfix, ggml-cuda) Pool alloc count fix + small size computation type adjustment (#18559)

Commit

55 days ago

(Bugfix, ggml-cuda) Pool alloc count fix + small size computation type adjustment (#18559) * CUDA: Fixed obj byte size instead of obj count being passed to pool alloc (fattn-common, dst_tmp_meta) * CUDA: Explicitly casted some of the int alloc counts before multiplication in argsort --------- Co-authored-by: pl752 <maximpl752@gmail.com>

References

#18559 - (Bugfix, ggml-cuda) Pool alloc count fix + small size computation type adjustment

Author

pl752

Parents

bcfc8c3c

llama.cpp 9dba9f53 - (Bugfix, ggml-cuda) Pool alloc count fix + small size computation type adjustment (#18559)

llama.cpp
9dba9f53 - (Bugfix, ggml-cuda) Pool alloc count fix + small size computation type adjustment (#18559)