transformers
Fix attention mask in mamba layers
#41790
Merged

Fix attention mask in mamba layers #41790

zucchini-nlp
HuggingFaceDocBuilderDev
vasqu
vasqu commented on 2025-10-22
zucchini-nlp
zucchini-nlp commented on 2025-10-22
zucchini-nlp not all mamba models are like LFM
3462502d
zucchini-nlp zucchini-nlp force pushed from efc2ee0f to 3462502d 203 days ago
zucchini-nlp Merge branch 'main' into fix-mamba-pad-masking
a57ffa63
zucchini-nlp
github-actions
zucchini-nlp
zucchini-nlp compile friendly
f07dbf49
zucchini-nlp
zucchini-nlp
zucchini-nlp commented on 2025-10-22
github-actions
vasqu
vasqu commented on 2025-10-22
zucchini-nlp adjust slow tests expectation
4f760ec0
zucchini-nlp naming
a8b35a02
github-actions
vasqu
vasqu approved these changes on 2025-10-22
zucchini-nlp zucchini-nlp merged 87be5595 into main 203 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone