transformers
cd478c35 - load the pretrained weights for encoder-decoder

Commit

6 years ago

load the pretrained weights for encoder-decoder We currently save the pretrained_weights of the encoder and decoder in two separate directories `encoder` and `decoder`. However, for the `from_pretrained` function to operate with automodels we need to specify the type of model in the path to the weights. The path to the encoder/decoder weights is handled by the `PreTrainedEncoderDecoder` class in the `save_pretrained` function. Sice there is no easy way to infer the type of model that was initialized for the encoder and decoder we add a parameter `model_type` to the function. This is not an ideal solution as it is error prone, and the model type should be carried by the Model classes somehow. This is a temporary fix that should be changed before merging.

Author

rlouf

Committer

rlouf

Parents

aa24121e

transformers cd478c35 - load the pretrained weights for encoder-decoder

transformers
cd478c35 - load the pretrained weights for encoder-decoder