[mistral] Support passing `head_dim` through config (and do not require `head_dim * num_heads == hidden_size`) #32050
Allow `head_dim` to be set in Mistral config
5771d59c
Add docstring
6a963544
Do not require `head_dim * num_heads == hidden_size`
dd3e56f5
[run-slow] mistral
3f8e6e42
xenova
merged
4c040aba
into main 1 year ago
xenova
deleted the mistral-head_dim branch 1 year ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub