llama.cpp
speculative : PoC for speeding-up inference via speculative sampling
#2926
Merged

Loading