transformers
6daa3eeb - Fix InternVL attention when using qk_norm (38B and 78B) (#37620)

Commit
352 days ago
Fix InternVL attention when using qk_norm (38B and 78B) (#37620) * fix internvlvision attention when using qk_norm * nit * modular
Author
Parents
Loading