text-generation-inference
343aa7a1 - fix: Handle concurrent grammar requests (#1610)

Commit
1 year ago
fix: Handle concurrent grammar requests (#1610) This PR fixes parallel grammar requests, currently grammar states are not concatenated correctly when a new request is added to the batch and this results in incorrect generation. This PR updates the `concatenate` function to correctly include the previous states. fixes: #1601
Author
Parents
Loading