transformers
Fix InternVL attention when using qk_norm (38B and 78B)
#37620
Merged

Fix InternVL attention when using qk_norm (38B and 78B) #37620

yonigozlan
yonigozlan fix internvlvision attention when using qk_norm
d3b2bbc0
github-actions github-actions marked this pull request as draft 354 days ago
github-actions
yonigozlan yonigozlan requested a review from Cyrilvallez Cyrilvallez 354 days ago
yonigozlan yonigozlan marked this pull request as ready for review 354 days ago
yonigozlan Merge branch 'main' into fix-large-internvl
f1450b4c
HuggingFaceDocBuilderDev
Cyrilvallez
Cyrilvallez approved these changes on 2025-04-19
yonigozlan Merge remote-tracking branch 'upstream/main' into fix-large-internvl
38111e04
yonigozlan nit
8d49d932
yonigozlan modular
dcf558c0
yonigozlan yonigozlan merged 6daa3eeb into main 353 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone