[CB] Changes for long generation #45530
Fix KV dedup for decode batches
1c076c60
Fix memory estimation
7eec987f
Change default
b4b74ff0
Added write-only fast path
88287d19
Take both peaks into account
dd00e9bd
Revert unused config field
1599e249
Review 1
7cd08da7
Fix p1s
86406269
Fix p2s and p3s that needed it
b853c854
Added a TODO
f39b68f5
Fix test, lower max cached graph, add TODO
4f814b2b
Fix fragmentation with big warmup
28ca9ed3
Add more space for logits processors
878f469d
Merge branch 'main' into cb-very-long-gen
5854ad11
remi-or
marked this pull request as ready for review 54 days ago
Merge branch 'main' into cb-very-long-gen
7bd12b33
Fix
1695d375
Merge branch 'main' into cb-very-long-gen
6b45f178
ArthurZucker
deleted the cb-very-long-gen branch 52 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub