vllm
a372f3f4 - [MISC] Fix Tensor Parallelism for Quantized Mamba Models with n_groups=1 (#33257)

Commit
22 days ago
[MISC] Fix Tensor Parallelism for Quantized Mamba Models with n_groups=1 (#33257) Signed-off-by: Vadim Gimpelson <vadim.gimpelson@gmail.com>
Author
Parents
Loading