lighteval
88e3a3bc - change tokenizer to pad to 'longest' sequence, instead of 'max_length' (#669)

Commit

309 days ago

change tokenizer to pad to 'longest' sequence, instead of 'max_length' (#669) otherwise, the response length is always 1 which is unexpected Co-authored-by: xgw <xinguang.wxg@alibaba-inc.com> Co-authored-by: Nathan Habib <30601243+NathanHB@users.noreply.github.com>

References

#669 - tokenizer 'padding' param is not correct.

Author

xgwang

Parents

5120c586

lighteval 88e3a3bc - change tokenizer to pad to 'longest' sequence, instead of 'max_length' (#669)

lighteval
88e3a3bc - change tokenizer to pad to 'longest' sequence, instead of 'max_length' (#669)