vllm
f7e62e3d - [Bugfix] Fix mismatch between global and local attention heads in tensor-parallel mode for param2moe model (#39707)

Commit

20 days ago

[Bugfix] Fix mismatch between global and local attention heads in tensor-parallel mode for param2moe model (#39707) Signed-off-by: bhargav-patel-29 <bhargav.patel@tihiitb.org> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

References

#39707 - [Bugfix] Fix mismatch between global and local attention heads in tensor-parallel mode for param2moe model

Author

bhargav-patel-29

Parents

18b1c772

vllm f7e62e3d - [Bugfix] Fix mismatch between global and local attention heads in tensor-parallel mode for param2moe model (#39707)

vllm
f7e62e3d - [Bugfix] Fix mismatch between global and local attention heads in tensor-parallel mode for param2moe model (#39707)