vllm
3c068c63
- [Kernel] Faster pre-processing time for W4A8 (#23972)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
90 days ago
[Kernel] Faster pre-processing time for W4A8 (#23972) Signed-off-by: czhu-cohere <conway.zhu@cohere.com>
References
#23972 - [Kernel] Faster pre-processing time for W4A8
Author
czhu-cohere
Parents
f20c3b09
Loading