vllm
e6327c9b
- [Feature] Support sequence parallelism for static fp8 quantization (#19181)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
172 days ago
[Feature] Support sequence parallelism for static fp8 quantization (#19181) Signed-off-by: cascade812 <cascade812@outlook.com>
References
#19181 - [Feature] Support sequence parallelism for static fp8 quantization
Author
cascade812
Parents
d0132f02
Loading