transformers
22b0a898 - Granite speech speedup + model saving bugfix (#39028)

Commit

226 days ago

Granite speech speedup + model saving bugfix (#39028) * ensure the query is updated during training avoid unused parameters that DDP does not like * avoid a crash when `kwargs` contain `padding=True` trainers often pass this argument automatically * minor * Remove mel_spec lazy init, and rename to mel_filters. this ensures save_pretrained will not crash when saving the processor during training https://github.com/huggingface/transformers/blob/d5d007a1a0f0c11a726a54c8f00bd71825f84d02/src/transformers/feature_extraction_utils.py#L595 * minor - most feature extractors has a `sampling_rate` property * speedup relative position embeddings * fix several issues in model saving/loading: - avoid modifying `self._hf_peft_config_loaded` when saving - adapter_config automatically points to the original base model - a finetuned version should point to the model save dir. - fixing model weights names, that are changed by adding an adapter. * minor * minor * minor * fixing a crash without peft active * add todo to replace einsum

References

#39028 - Granite speech speedup + model saving bugfix

Author

avihu111

Parents

1d45d90e

transformers 22b0a898 - Granite speech speedup + model saving bugfix (#39028)

transformers
22b0a898 - Granite speech speedup + model saving bugfix (#39028)