Allow for model kwargs when loading transformers from pretrained (#754)
## Pull Request Overview
This PR introduces support for passing custom keyword arguments when loading pretrained transformer models, enabling more flexible configuration of model loading. It also replaces the fixed "generation_size" parameter with a more general "model_loading_kwargs" field.
- Removed the fixed generation_size parameter.
- Added a new model_loading_kwargs field to the configuration.
- Updated the auto model creation to copy the provided kwargs.
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
* suggestion from copilot
---------
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>