text-generation-inference
e86cca97 - Adding docs on how dynamic batching works. (#258)

Commit
2 years ago
Adding docs on how dynamic batching works. (#258) This PR starts the minimal possible amount of explanation I could think of. It tries to explain how dynamic batching occurs, the interactions with past key values and ignores the padding problem. Maybe some drawings could help too but I kept it to text for now.
Author
Parents
Loading