llama.cpp
58970a4c - Leverage mmap for offloading tensors to GPU (#1597)

Commit

2 years ago

Leverage mmap for offloading tensors to GPU (#1597) * Rebase to latest * Show progress * Add assert to make sure we only allocate temp buffer for non-CPU backend tensor Co-authored-by: Johannes Gäßler <johannesg@5d6.de> --------- Co-authored-by: Johannes Gäßler <johannesg@5d6.de>

References

#1597 - Leverage mmap to offloading the tensors

Author

howard0su

Parents

8c0a10e6

llama.cpp 58970a4c - Leverage mmap for offloading tensors to GPU (#1597)

llama.cpp
58970a4c - Leverage mmap for offloading tensors to GPU (#1597)