llama.cpp
b061ba9e - llama : fix top-p sampling to match the canonical definition (#1953)

Commit
2 years ago
llama : fix top-p sampling to match the canonical definition (#1953) * Fix top-p sampling to match the standard definition (smallest set that has probability mass at least p, not largest set with probability mass less than p) * top-p: correct gt to gte * add test for correct top-p behavior
Author
Parents
Loading