transformers
Fix graph break in torch.compile when using FA2 with attention_mask=None and batch size > 1
#37332
Merged

Fix graph break in torch.compile when using FA2 with attention_mask=None and batch size > 1 #37332

efsotr
efsotr Fix graph break in torch.compile when using FA2 with attention_mask=N…
422e3b3b
github-actions github-actions marked this pull request as draft 266 days ago
github-actions
efsotr efsotr marked this pull request as ready for review 266 days ago
github-actions github-actions requested a review from ArthurZucker ArthurZucker 266 days ago
github-actions github-actions requested a review from Rocketknight1 Rocketknight1 266 days ago
efsotr fix code format
69a2b038
ArthurZucker
ArthurZucker commented on 2025-04-08
efsotr Merge remote-tracking branch 'upstream/main' into fa2_compile_graph_b…
d9424933
efsotr add test; replace position_ids with query_states becasue position_ids…
1dff2564
efsotr Merge remote-tracking branch 'upstream/main' into fa2_compile_graph_b…
812cb5a5
ArthurZucker
ArthurZucker approved these changes on 2025-04-08
efsotr Merge branch 'main' into fa2_compile_graph_break
a381b48d
efsotr Merge branch 'main' into fa2_compile_graph_break
b1c101bf
efsotr Merge branch 'main' into fa2_compile_graph_break
7cb8d411
efsotr add assert loss is not nan
5028afc6
efsotr
llllvvuu
efsotr Merge branch 'main' into fa2_compile_graph_break
ba3ad0c3
efsotr
ArthurZucker ArthurZucker enabled auto-merge (squash) 187 days ago
ArthurZucker
ArthurZucker ArthurZucker merged 3ee72af6 into main 187 days ago
HuggingFaceDocBuilderDev

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone