vllm
604b9eae - [BUGFIX] Fix accuracy regression for NVIDIA-Nemotron-3-Nano-30B-A3B-NVFP4 with TP>1 (#34476)

Commit
6 days ago
[BUGFIX] Fix accuracy regression for NVIDIA-Nemotron-3-Nano-30B-A3B-NVFP4 with TP>1 (#34476) Signed-off-by: Vadim Gimpelson <vadim.gimpelson@gmail.com>
Author
Parents
Loading