text-generation-inference
a6c18c39
- feat(server): use cuda graph in logits warping (#302)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
2 years ago
feat(server): use cuda graph in logits warping (#302)
References
#302 - feat(server): use cuda graph in logits warping
Author
OlivierDehaene
Parents
35ab6cfc
Loading