change tokenizer to pad to 'longest' sequence, instead of 'max_length' (#669)
otherwise, the response length is always 1 which is unexpected
Co-authored-by: xgw <xinguang.wxg@alibaba-inc.com>
Co-authored-by: Nathan Habib <30601243+NathanHB@users.noreply.github.com>