text-generation-inference
ad94f299
- feat: compile vllm for cuda after flash_attn
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 year ago
feat: compile vllm for cuda after flash_attn
Author
drbh
Committer
drbh
Parents
8253f830
Loading