transformers
[mistral] Support passing `head_dim` through config (and do not require `head_dim * num_heads == hidden_size`)
#32050
Merged

[mistral] Support passing `head_dim` through config (and do not require `head_dim * num_heads == hidden_size`) #32050

xenova merged 4 commits into main from mistral-head_dim
xenova
xenova Allow `head_dim` to be set in Mistral config
5771d59c
xenova Add docstring
6a963544
xenova Do not require `head_dim * num_heads == hidden_size`
dd3e56f5
xenova xenova requested a review from ArthurZucker ArthurZucker 1 year ago
HuggingFaceDocBuilderDev
ydshieh ydshieh added run-slow
xenova [run-slow] mistral
3f8e6e42
ydshieh ydshieh requested a review from amyeroberts amyeroberts 1 year ago
ArthurZucker
ArthurZucker approved these changes on 2024-07-18
amyeroberts
amyeroberts approved these changes on 2024-07-18
xenova xenova merged 4c040aba into main 1 year ago
xenova xenova deleted the mistral-head_dim branch 1 year ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone