llama.cpp
CPU/CUDA: fix GQA mul mat back, add CUDA support
#11380
Merged

CPU/CUDA: fix GQA mul mat back, add CUDA support #11380

JohannesGaessler
github-actions github-actions added testing
github-actions github-actions added Nvidia GPU
github-actions github-actions added ggml
ggerganov
JohannesGaessler
ggerganov
JohannesGaessler CPU/CUDA: fix (GQA) mul mat back, add CUDA support
ae4cca3e
JohannesGaessler JohannesGaessler force pushed to ae4cca3e 1 year ago
JohannesGaessler
ggerganov
ggerganov approved these changes on 2025-01-24
JohannesGaessler JohannesGaessler merged 8137b4bb into master 1 year ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone