transformers
badc71b9 - 🔴[`Attention`] Attention refactor for Whisper-based models (#38235)

Commit

202 days ago

🔴[`Attention`] Attention refactor for Whisper-based models (#38235) * start refactoring whisper * revert for now * first step * carry over attn fixes * check if this works * whisper has an off by one somewhere - cutting mask in any interface * make it based on interface * remove some tests that were skipped but now work * some fixes for whisper tests * interface changes * change the order of fix * some attention adjustments for eager + TP * fix scaling * mask changes * why does whisper contain those extra seq lens? * fix from config for fa2 as input_ids is invalid * fix another test * another fix * disable flex attn due to compile issues * copies and refactor for qwen audio since it somewhat relies on whisper * fix scaling and smaller things * retrigger * new new interface version + more fixups * adjust qwen * add comment * forgot this one * change copies as whisper cuts on the mask * add guard * add flex attention * switch to new mask function + add skips for torchscript * remove old api with cache position * last changes? * trigger ci

References

run_amd_scheduled_ci_caller

#38235 - 🔴[`Attention`] Attention refactor for Whisper-based models

Author

vasqu

Parents

565a0052

transformers badc71b9 - 🔴[`Attention`] Attention refactor for Whisper-based models (#38235)

transformers
badc71b9 - 🔴[`Attention`] Attention refactor for Whisper-based models (#38235)