transformers
Continuous batching refactor
#40426
Merged

Continuous batching refactor #40426

ArthurZucker merged 27 commits into huggingface:main from remi-or:conbat
remi-or
remi-or remi-or force pushed from f6e2c494 to b179844e 116 days ago
HuggingFaceDocBuilderDev
ArthurZucker
ArthurZucker approved these changes on 2025-08-25
ArthurZucker
ArthurZucker approved these changes on 2025-08-26
remi-or Rework of the CB example
9495e0e1
remi-or Further rework of CB example
cce99f07
remi-or Refactor PA cache, slice on tokens, add debug prints -- WIP
74a0d73d
remi-or Slice cache -- WIP
79118c53
remi-or Added a mechanism to check batched outputs in CB script
dc53ad60
remi-or Less logging, debug flag for slice, !better reset! -- WIP
b107785c
remi-or QOL and safety margins
bababa46
remi-or Refactor and style
f01e9db4
remi-or Better saving of cb example
3cffe20e
remi-or Fix
7cd70ac1
remi-or Fixes and QOL
2933099b
remi-or Mor einformations about metrics
bfcf6117
remi-or Further logging
f000b17d
remi-or Style
042e87dd
remi-or Licenses
604fe6e5
remi-or Removed some comments
ef635477
remi-or Add a slice input flag
d403b02f
remi-or Fix in example
023774fd
remi-or Added back some open-telemetry deps
c327f08d
remi-or Removed some aux function
173b497a
remi-or Added FA2 option to example script
fff2ee8a
remi-or Fixed math (all of it)
7353aef1
remi-or Added a simple example
0de06e30
remi-or Renamed core to classes
8325b376
remi-or Made allocation of attention mask optionnal
7dee44e4
remi-or Style
3f17daf8
remi-or remi-or force pushed from c7b820e1 to 3f17daf8 116 days ago
remi-or Merge branch 'main' into conbat
463ea913
ArthurZucker ArthurZucker merged 34108a22 into main 116 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone