fix nvfp act quantization bug #891
fix nvfp act quantization bug
211c2fbb
add cuda ut for moe nvfp quantize
68a38d78
add cpu UT, refine cuda UT
9c94b04b
[pre-commit.ci] auto fixes from pre-commit.com hooks
19f09d54
fix ut typo
2f32fec1
[pre-commit.ci] auto fixes from pre-commit.com hooks
59e184e9
fix cpu ut
2dd7a697
fixtypo
cb70d6d6
Merge branch 'main' into fix_nvfp_qact_bug
356f2edf
enhance experts amax match, refine UT
ffa023d5
[pre-commit.ci] auto fixes from pre-commit.com hooks
cf26c2bc
WeiweiZhang1
deleted the fix_nvfp_qact_bug branch 149 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub