[Feature]: support serving nvfp4 W4A16 moe models uisng Marlin #30906
Add CompressedTensorsW4A16NVfp4MoeMethod to serve nvfp4 W4A16 quantiz…
ade34857
fix naming
419c1b32
add EPLB check
8f54548d
Merge branch 'main' into main
a148e770
fix method signitures
88494b62
align maybe_make_prepare_finalize inputs with the parent
051c97ef
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub