llama.cpp
Fix CUDA softmax by subtracting max value before exp
#2665
Merged

Loading