[fix] sliding window attention mask #38045
fix sliding attn
10df8030
zucchini-nlp
marked this pull request as ready for review 324 days ago
Merge branch 'main' into fix-sliding-attn
3afee994
make style
df740a60
gante
approved these changes
on 2025-05-13
Update tests/test_modeling_common.py
e6ff2763
no a second throught, should default to `True` fo BC
1c351314
Merge branch 'main' into fix-sliding-attn
cb21baed
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub