transformers
b84ebb7f - fix(qwen3_moe): pass kwargs to self_attn (#38691)

Commit
185 days ago
fix(qwen3_moe): pass kwargs to self_attn (#38691) This is needed to avoid `.item()` calls in `_flash_attention_forward`.
Author
Parents
Loading