[Kernel] port sgl moe_align_block_size kernels (#12574)
sgl_moe_align_block_size is based on:
https://github.com/sgl-project/sglang/commit/ded9fcd09a43d5e7d5bb31a2bc3e9fc21bf65d2a
moe_align_block_size is based on:
https://github.com/sgl-project/sglang/commit/ba5112ff691d791a9e38c6c71f59324a5fcb49d0
Signed-off-by: Yang Chen <yangche@fb.com>