transformers
22ddccc4 - Attempt to use custom cuda kernels for speed up inference for bloom.

Commit
3 years ago
Attempt to use custom cuda kernels for speed up inference for bloom.
Author
Parents
Loading