llama.cpp
CUDA: MMQ code deduplication + iquant support
#8495
Merged

CUDA: MMQ code deduplication + iquant support #8495

JohannesGaessler
JohannesGaessler
github-actions github-actions added Nvidia GPU
github-actions github-actions added python
JohannesGaessler
JohannesGaessler JohannesGaessler added Review Complexity : High
JohannesGaessler
Nexesenex
JohannesGaessler
Green-Sky
slaren
JohannesGaessler
oldgithubman
JohannesGaessler
slaren
JohannesGaessler
slaren
slaren
slaren approved these changes on 2024-07-17
slaren
JohannesGaessler JohannesGaessler force pushed 1 year ago
slaren
Green-Sky
JohannesGaessler CUDA: MMQ code deduplication + iquant support
f0f71a5d
JohannesGaessler
JohannesGaessler JohannesGaessler force pushed to f0f71a5d 1 year ago
JohannesGaessler
Green-Sky
ggerganov
JohannesGaessler 1 less parallel job for CI build
0282b716
Green-Sky
ggerganov
github-actions github-actions added devops
Green-Sky
JohannesGaessler
slaren
ggerganov
JohannesGaessler JohannesGaessler merged 69c487f4 into master 1 year ago
JohannesGaessler

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone