vllm
[Model] Enable FP8 QKV in MoE and refine kernel tuning script
#5039
Merged

[Model] Enable FP8 QKV in MoE and refine kernel tuning script #5039

comaniac
comaniac wip
487c724d
comaniac a
8c06d5f2
mgoin
comaniac
comaniac script
b7cf8968
comaniac add 8x22b tp=8
a1c886f1
comaniac fix
0af621db
comaniac comaniac changed the title [Model] Enable FP8 QKV in MoE [Model] Enable FP8 QKV in MoE and refine kernel tuning script 1 year ago
comaniac finish tuning
81399b65
robertgshaw2-redhat
robertgshaw2-redhat approved these changes on 2024-05-31
robertgshaw2-redhat
LiuXiaoxuanPKU LiuXiaoxuanPKU merged e9899fb7 into main 1 year ago
comaniac comaniac deleted the moe-lienar branch 1 year ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone