transformers
c63a3d0f - Fix: Mamba2 `norm_before_gate` usage (#32686)

Commit
1 year ago
Fix: Mamba2 `norm_before_gate` usage (#32686) * mamba2 uses norm_before_gate=False * small nit * remove norm_before_gate flag and follow False path only
Author
Parents
Loading