transformers
4391cfd3 - perf: Optimization for Min-p sampling implementation (#42248)

Commit
31 days ago
perf: Optimization for Min-p sampling implementation (#42248) * refactor(MinPLogitsWarper): optimizing min_tokens_to_keep * Fix(MinPLogitsWarper): edge case when min_tokens_to_keep > vocab_size
Author
Parents
Loading