[Paged-Attention] Handle continuous batching for repetition penalty #39457
Handle continuous batching for repetition penalty
635bc875
fix last scores and with token mask creation
84cf493f
add test
e34f1f49
Merge branch 'main' into penalty-cb
0cfcee20
Merge branch 'main' into penalty-cb
622e68bf
Update src/transformers/generation/continuous_batching.py
4e9cc2f6
Update src/transformers/generation/logits_process.py
2393d7b1
fix formatting
57aa84b9
kashif
commented
on 2025-07-22
remove unneeded cast
0909e421
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub