transformers.js
[WIP] Add NeMo Conformer TDT ASR support
#1571
Open

[WIP] Add NeMo Conformer TDT ASR support #1571

ysdede wants to merge 42 commits into huggingface:main from ysdede:v4-nemo-conformer-tdt-main-r3
ysdede
ysdede feat(nemo-conformer-tdt): port Nemo Conformer TDT model and ASR pipeline
d0176023
ysdede fix(nemo-conformer-tdt): handle empty token decode output
964bc8fa
ysdede chore(nemo-conformer-tdt): keep typegen compatibility for transcribe/…
fa9bc254
ysdede refactor(nemo-conformer-tdt): redesign transcribe output shape and API
63aeee83
ysdede fix(nemo-conformer-tdt): round timestamps and confidences, simplify p…
f6835ad8
ysdede fix: dispose tensors on error path, decouple frame confidences from t…
2dd36a11
ysdede merge: sync with upstream (ModelRegistry API refactor, commit 4811a61)
10977df7
ysdede feat: integrate NemoConformerTDT with ModelRegistry API
39d9be4f
ysdede style: replace console.warn with logger.warn in feature extractor
3d984e52
ysdede fix(nemo-conformer-tdt): harden edge cases, restore pipeline design
9f3a2202
ysdede test(nemo-conformer-tdt): rewrite tests to match current API
3bac1dc0
ysdede fix(nemo-conformer-tdt): harden decoding and feature utilities
c75ebd27
ysdede fix(nemo-conformer-tdt): address critical review issues
493a5881
ysdede fix(nemo-conformer-tdt): clamp timestamps and validate cache limits
5b4cdabd
ysdede fix(nemo-conformer-tdt): close remaining tensor disposal leaks
76902274
ysdede fix(nemo-conformer-tdt): dispose auxiliary decoder outputs
1f065c36
ysdede perf(nemo-conformer-tdt): avoid tolist in length hot path
ec09a090
ysdede fix(nemo-conformer-tdt): harden duration and audio validation
8a90a7c9
ysdede fix: address prioritized review findings
ce0a3eb9
ysdede fix(nemo-conformer-tdt): apply low-risk hardening follow-ups
5d91d396
ysdede fix(nemo-conformer-tdt): enforce named outputs and frame-level confid…
dfc2c130
ysdede docs(nemo-conformer-tdt): clarify cached tensor sharing semantics
a5bd2cf7
ysdede fix(nemo-conformer-tdt): harden decoder I/O validation and feed cleanup
abada622
ysdede fix(nemo-conformer-tdt): address bot review findings
62d8bc0d
ysdede style(nemo-conformer-tdt): simplify duration frame expression
03fb8bd2
ysdede fix(nemo-tdt): address PR10 follow-up review comments
426061e5
ysdede fix(transformers): resolve Nemo TDT typegen regressions
d7476a6d
ysdede fix(nemo-tdt): address PR11 cache and vocab review feedback
0989f7ab
ysdede merge: sync upstream/main through #1559 and keep Nemo TDT ASR wiring
49a4af8f
ysdede fix(nemo-tdt): add supports() for ASR model class selection
ee819a1c
ysdede fix(model-registry): include processor files for text-to-audio pipelines
b44f7f3f
ysdede Revert "fix(model-registry): include processor files for text-to-audi…
bfa97e6e
ysdede fix(nemo-tdt): address PR #12 reviewer feedback
a85dff25
ysdede feat(nemo-tdt): align asr pipeline outputs and long-audio handling
8dfccddc
ysdede chore(tests): drop unrelated parakeet feature extractor coverage
816f5811
ysdede feat(nemo-conformer-tdt): add sentence-based ASR pipeline chunking
f59ba068
ysdede fix(nemo): scope ASR tests and address review fixes
00b3d934
ysdede fix(nemo-tdt): address follow-up review threads
07118c38
ysdede chore(asr): restore upstream cast spacing
341df3d7
ysdede fix(nemo-tdt): handle sparse vocab and merge dedupe
29f2baaf
ysdede fix(nemo-tdt): merge split domain suffix words
39e5cb1d
ysdede Revert "fix(nemo-tdt): merge split domain suffix words"
495bab59
ysdede ysdede marked this pull request as draft 1 day ago

Login to write a write a comment.

Login via GitHub

Reviewers
No reviews
Assignees
No one assigned
Labels
Milestone