llama.cpp
llama : disable graph reuse when contexts share memory under SPLIT_MODE_TENSOR
#24549
Open

llama : disable graph reuse when contexts share memory under SPLIT_MODE_TENSOR #24549

nycdubliner
nycdubliner nycdubliner requested a review from ggerganov ggerganov 1 day ago
ggml-gh-bot
nycdubliner nycdubliner marked this pull request as draft 1 day ago
nycdubliner llama : disable graph reuse when contexts share memory under SPLIT_MO…
9432df67
nycdubliner nycdubliner force pushed from e0ba2c88 to 9432df67 1 day ago
nycdubliner
nycdubliner nycdubliner marked this pull request as ready for review 1 day ago
philpax
nycdubliner
nycdubliner

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone