llama.cpp
Grammar optimization: eliminate redundant grammar trees (~4x faster grammar sampling)
#6616
Merged

Loading