llama.cpp
fb19f94c
- TP: fix 0-sized tensor slices, AllReduce fallback (#21808)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
45 days ago
TP: fix 0-sized tensor slices, AllReduce fallback (#21808) * TP: fix 0-sized tensor slices, AllReduce fallback * fix layer structure <-> GPU count aliasing * add missing std::fill * fix CUDA device set, max ggml ctx size
References
#21808 - TP: fix 0-sized tensor slices, AllReduce fallback
Author
JohannesGaessler
Parents
7f251fdb
Loading