model: add Mellum architecture (#23966)
* model: support for Mellum architecture
* model: improve mellum.py formatting
* model: improve mellum.py formatting once again
* deps: downgrade transformers to 4.57.6 (to fix CI)
* deps: remove huggingface_hub dependency
* deps: remove huggingface_hub from test requirements
---------
Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>