llama.cpp
Recognize IBM Granite 3.3 FIM tokens. Makes llama-server /infill usable.
#12988
Merged

Loading