llama.cpp
CUDA: tighter VRAM scratch size for 65b/70b
#2551
Merged

CUDA: tighter VRAM scratch size for 65b/70b #2551

JohannesGaessler
JohannesGaessler CUDA: tighter VRAM scratch size for 65b/70b
5d8b7659
ggerganov
ggerganov approved these changes on 2023-08-08
JohannesGaessler JohannesGaessler merged acfc5478 into master 2 years ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone