vllm
e9899fb7
- [Model] Enable FP8 QKV in MoE and refine kernel tuning script (#5039)
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Commit
View On
GitHub
Commit
1 year ago
[Model] Enable FP8 QKV in MoE and refine kernel tuning script (#5039)
References
#5039 - [Model] Enable FP8 QKV in MoE and refine kernel tuning script
Author
comaniac
Parents
a377f0bd
Loading