vllm
8de2fd39
- deep_ep + use_fp8_dispatch
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
350 days ago
deep_ep + use_fp8_dispatch Signed-off-by: Varun Sundar Rabindranath <vsundarr@redhat.com>
References
#19885 - [EP+DP] Optimize the little operations in the DeepGEMM + DeepEP low latency case
Author
Varun Sundar Rabindranath
Parents
4c8f64fa
Loading