transformers
6daa3eeb - Fix InternVL attention when using qk_norm (38B and 78B) (#37620)

Commit

352 days ago

Fix InternVL attention when using qk_norm (38B and 78B) (#37620) * fix internvlvision attention when using qk_norm * nit * modular

References

Author

yonigozlan

Parents