llama.cpp
Clblast fixes + enhancements to save VRAM and offload more layers
#1675
Merged

Clblast fixes + enhancements to save VRAM and offload more layers #1675

0cc4m merged 12 commits into ggml-org:master from LostRuins:concedo-opencl-dev
LostRuins
0cc4m Use events instead of clFinish, where possible
ebc5d065
0cc4m OpenCL: Don't load gpu layers into RAM, add mul_f32 kernel
97c5cca4
0cc4m Reduce queueing overhead for contiguous tensors by using single mul k…
ac6b49ed
0cc4m Merge remote-tracking branch 'origin/master' into opencl-dev
49aaf083
0cc4m Adapt to #1612 cl_mem malloc changes
5e1eecfe
0cc4m Reduce code duplication between cuda and opencl branches
457aaf5b
0cc4m Improve implementation
24239f0d
LostRuins Clblast fixes + enhancements to save VRAM:
59fe1687
github-actions
github-actions commented on 2023-06-02
YellowRoseCx
LostRuins Merge branch 'master' into concedo-opencl-dev
2b700749
LostRuins change max value size_t to use limits
64e3e745
github-actions
github-actions commented on 2023-06-04
0cc4m
0cc4m commented on 2023-06-04
LostRuins removed flags from the CL pool malloc, apply code tidying suggestions.
f6431ded
LostRuins
LostRuins LostRuins requested a review from 0cc4m 0cc4m 2 years ago
0cc4m
0cc4m requested changes on 2023-06-06
0cc4m Update ggml-opencl.cpp
b6dd367b
0cc4m 0cc4m requested a review from 0cc4m 0cc4m 2 years ago
0cc4m
0cc4m approved these changes on 2023-06-06
0cc4m 0cc4m merged d5b111f5 into master 2 years ago
LostRuins LostRuins deleted the concedo-opencl-dev branch 2 years ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone