transformers
Fix InternVL attention when using qk_norm (38B and 78B)
#37620
Merged

Loading