transformers
6d369124 - Align torch implementation of Gated DeltaNet in Qwen3-Next with fla library. (#40807)

Commit
254 days ago
Align torch implementation of Gated DeltaNet in Qwen3-Next with fla library. (#40807) * align torch implementation of gdn with fla. * fix fla import. * fix * remove unused attr * fixes --------- Co-authored-by: bozheng-hit <dsoul0621@gmail.com> Co-authored-by: Cyril Vallez <cyril.vallez@gmail.com>
Author
Parents
Loading