llama.cpp
Leverage mmap to offloading the tensors
#1597
Merged

Leverage mmap to offloading the tensors #1597

howard0su
JohannesGaessler
JohannesGaessler
JohannesGaessler commented on 2023-05-26
JohannesGaessler
howard0su
howard0su
JohannesGaessler
SlyEcho
howard0su
howard0su
JohannesGaessler
JohannesGaessler
github-actions
github-actions commented on 2023-05-27
howard0su howard0su changed the title Leverage mmap for CUDA loading Leverage mmap to offloading the tensors 2 years ago
howard0su howard0su force pushed 2 years ago
howard0su howard0su force pushed 2 years ago
howard0su howard0su requested a review from JohannesGaessler JohannesGaessler 2 years ago
JohannesGaessler
JohannesGaessler commented on 2023-05-30
howard0su
JohannesGaessler
SlyEcho
howard0su howard0su force pushed 2 years ago
howard0su
0cc4m
ggerganov
JohannesGaessler
JohannesGaessler
JohannesGaessler commented on 2023-06-10
JohannesGaessler
JohannesGaessler commented on 2023-06-10
howard0su
howard0su Rebase to latest
921d87ca
howard0su howard0su force pushed to 921d87ca 2 years ago
howard0su Show progress
34ca572e
howard0su howard0su requested a review from JohannesGaessler JohannesGaessler 2 years ago
howard0su howard0su requested a review from 0cc4m 0cc4m 2 years ago
howard0su
howard0su commented on 2023-06-11
JohannesGaessler
JohannesGaessler commented on 2023-06-11
howard0su Add assert to make sure we only allocate temp buffer for non-CPU back…
61726bd9
JohannesGaessler
JohannesGaessler
JohannesGaessler approved these changes on 2023-06-12
JohannesGaessler JohannesGaessler merged 58970a4c into master 2 years ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone