transformers
b16688e9 - General weight initialization scheme (#39579)

Commit
276 days ago
General weight initialization scheme (#39579) * general + modulars from llama * all modular models * style and fix musicgen * fix * Update configuration_musicgen.py * Update modeling_utils.py
Author
Parents
Loading