llama.cpp
CUDA: manage NCCL communicators in context
#21891
Merged

CUDA: manage NCCL communicators in context #21891

JohannesGaessler
JohannesGaessler JohannesGaessler requested a review 38 days ago
JohannesGaessler JohannesGaessler requested a review from ggerganov ggerganov 38 days ago
JohannesGaessler
JohannesGaessler CUDA: manage NCCL communicators in context
3a396fd7
JohannesGaessler JohannesGaessler force pushed from fd1d82bd to 3a396fd7 38 days ago
JohannesGaessler JohannesGaessler changed the title CUDA: manage NCCL communictors in context CUDA: manage NCCL communicators in context 38 days ago
ggerganov
ggerganov commented on 2026-04-14
gaugarg-nv
gaugarg-nv
gaugarg-nv commented on 2026-04-14
JohannesGaessler add check that all backends are CUDA
d49b1c1e
JohannesGaessler remove unused vector, limit init to > 1 GPUs
2c43969c
JohannesGaessler JohannesGaessler force pushed from 20457274 to 2c43969c 38 days ago
github-actions github-actions added Nvidia GPU
github-actions github-actions added ggml
gaugarg-nv
JohannesGaessler fix warnings
db53263d
gaugarg-nv
gaugarg-nv commented on 2026-04-15
ggerganov
ggerganov approved these changes on 2026-04-15
JohannesGaessler fix cuda device, cache allreduce
8c78c5c2
ggerganov
ggerganov approved these changes on 2026-04-15
JohannesGaessler
am17an
am17an approved these changes on 2026-04-15
JohannesGaessler JohannesGaessler merged 014dca49 into master 37 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone