text-generation-inference
44ce098c
- feat(server): pre-allocate max attention mask (#75)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
2 years ago
feat(server): pre-allocate max attention mask (#75)
References
#75 - feat(server): pre-allocate max attention mask
Author
OlivierDehaene
Parents
78063c05
Loading