vllm
e84e0735 - fix: revert cast to cpu in `MsgpackEncoder._encode_tensor` to avoid hidden performance regressions (#25738)

Commit
138 days ago
fix: revert cast to cpu in `MsgpackEncoder._encode_tensor` to avoid hidden performance regressions (#25738) Signed-off-by: Andrew Sansom <andrew@protopia.ai>
Parents
Loading