llama.cpp
llama : disable graph reuse when contexts share memory under SPLIT_MODE_TENSOR
#24549
Open

llama : disable graph reuse when contexts share memory under SPLIT_MODE_TENSOR #24549

nycdubliner
nycdubliner nycdubliner requested a review from ggerganov ggerganov 12 days ago
ggml-gh-bot
nycdubliner nycdubliner marked this pull request as draft 12 days ago
nycdubliner llama : disable graph reuse when contexts share memory under SPLIT_MO…
9432df67
nycdubliner nycdubliner force pushed from e0ba2c88 to 9432df67 12 days ago
nycdubliner
nycdubliner nycdubliner marked this pull request as ready for review 12 days ago
philpax
nycdubliner
nycdubliner
lapy

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone