llama.cpp
CUDA Graph Compute Function Refactor (precursor for performance improvements)
#11042
Merged

CUDA Graph Compute Function Refactor (precursor for performance improvements) #11042

aendk
github-actions github-actions added Nvidia GPU
github-actions github-actions added ggml
aendk aendk marked this pull request as draft 282 days ago
aendk
ggerganov
aendk aendk force pushed from 3998c0dc to 004ec3ad 277 days ago
aendk Refactor: Moves cuda graph executable update step to separate function.
ba053310
aendk Refactor: Moves cuda graph update check to separate function.
22c24294
aendk Refactor: Moves cuda graph maintenance (update or adjusting copy para…
eb3ea698
aendk Fix: Adds missing reference to maintain_cuda_graph() definition.
ed10ff58
aendk Refactor: Improves structure and abstractions by moving CUDA graph ev…
37518b7d
aendk Refactor: Moves node graph checks and copy ops into individual functi…
0cdc1339
aendk Refactor: Removes code permanently excluded from compilation to incre…
dd95edfc
aendk Style: Adds missing newline
98d4e55f
aendk aendk force pushed from 004ec3ad to 98d4e55f 275 days ago
aendk aendk marked this pull request as ready for review 275 days ago
aendk
slaren
slaren commented on 2025-01-10
aendk Style: Consolidates several neighboring '#ifdef USE_CUDA_GRAPH' into …
fcd62d9d
aendk Refactor: Makes 'cuda_graph_update_required' a local variable
62f2f624
aendk
slaren
slaren approved these changes on 2025-01-13
slaren remove double lines between functions
5226732f
slaren slaren merged 39509fb0 into master 271 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone