llama.cpp
fb19f94c - TP: fix 0-sized tensor slices, AllReduce fallback (#21808)

Commit

45 days ago

TP: fix 0-sized tensor slices, AllReduce fallback (#21808) * TP: fix 0-sized tensor slices, AllReduce fallback * fix layer structure <-> GPU count aliasing * add missing std::fill * fix CUDA device set, max ggml ctx size

References

#21808 - TP: fix 0-sized tensor slices, AllReduce fallback

Author

JohannesGaessler

Parents

7f251fdb

llama.cpp fb19f94c - TP: fix 0-sized tensor slices, AllReduce fallback (#21808)

llama.cpp
fb19f94c - TP: fix 0-sized tensor slices, AllReduce fallback (#21808)