Support llamav27b training (#1924)

Commit

2 years ago

Support llamav27b training (#1924) Summary: Training should work fine on A100 so adding a skip for a10G EDIT: Na training OOMs I might be able to reduce the input length for trianing though to reduce activation memory Pull Request resolved: https://github.com/pytorch/benchmark/pull/1924 Reviewed By: xuzhao9 Differential Revision: D49584103 Pulled By: msaroufim fbshipit-source-id: 7488e368692404397fd9b227f78092ce35c0edcc

Author

msaroufim

Committer

facebook-github-bot

Parents

8a4ba7f0

benchmark 64dec593 - Support llamav27b training (#1924)

benchmark
64dec593 - Support llamav27b training (#1924)