Add TDT loss kernel #46048

ebezzam wants to merge 82 commits into huggingface:main from ebezzam:tdt_loss_kernel
ebezzam
parakeet tdt intergration
fa7d6e0e
lmaksym Add TDT decoder support for Parakeet ASR models
f2b49380
ebezzam Add expected outputs for TDT, small fixes.
fa36657f
ebezzam Separate CTC and TDT generate outputs.
05e2e346
ebezzam Work with auto device, better init,
bb5ff331
ebezzam Test timestamps and expose token duration.
9ec79b02
ebezzam Add reproducer link.
33f128ec
lmaksym fix: align TDT training and decoding with NeMo implementation
760b4b61
lmaksym revert: restore lasr generated files to original state
b33002fc
lmaksym warn: torchaudio rnnt_loss does not train duration head
48b39dd1
ebezzam Relax timestamp test, and test nits.
e9f23ab6
lmaksym feat: TDT training
e2b97aa1
lmaksym chore: for cuda detection and run without patching
6b9fc731
ebezzam Equivalent timestamp processing as Nemo, and various nits/cleanup.
6c879bc0
ebezzam Merge branch 'parakeet-tdt' of github.com:lmaksym/transformers into p…
149e17f4
ebezzam Simplify durations config.
36bfa639
ebezzam Update training examples.
2df0ccca
lmaksym chore: enable parralelism
388c6d36
lmaksym chore: performance optimization
08b2b558
lmaksym fix: formatting
0c4e05a8
ebezzam Doc and testing nits
1ddd8049
ebezzam Use active mask from current step, and nits.
f5126703
ebezzam Better pre-allocate.
07d8e35e
ebezzam TDT has separate pad token and blank token.
fab050a3
ebezzam Merge branch 'main' into parakeet-tdt
c438565c
ebezzam Regenerate lasr.
86d980c1
ebezzam Merge branch 'parakeet-tdt' of github.com:lmaksym/transformers into p…
895c4a0a
ebezzam Style checks and nits
ab21380b
ebezzam Nits, put back ctc loss test
d0141d5f
ebezzam More standard model output.
f7529d41
ebezzam Style
77b95d73
ebezzam Remove compute_loss flag and allow monkey patching to tdt loss
94eae66f
ebezzam Update src/transformers/models/parakeet/modular_parakeet.py
f7d40675
ebezzam Address various comments.
f75c17b6
ebezzam More compatible with Transformers forward/generate approach
5a49b651
ebezzam compile option for generation and decoder cache
881233fd
ebezzam Cleaner, better conventions.
b41a8ee6
ebezzam Merge branch 'main' into parakeet-tdt
897753a0
ebezzam Update with main.
6c914dbe
ebezzam doc nits
756cee1e
ebezzam Imitate whisper for encoder outputs as input
f30c5364
ebezzam Address tests and nits.
fa95fc8e
ebezzam Inherit from GenerateMixIn for get_compiled_call
5df7f289
ebezzam Comment nit
cd706d48
eustlb forward cleanup
a47ed8a5
eustlb generate cleanup + separate generation file
13b68cec
eustlb generate: add _supported_generation_modes
72c1ad00
eustlb automatic init of the loss
8e23b3df
eustlb modular cleanups
1cc39fd8
eustlb use is_encoder_decoder
531f297e
eustlb timestamp processing fully from tokens + durations
2c0f23af
eustlb convertion script update
cef6639e
eustlb test update
fd3cf9b2
eustlb make
e63a5bf1
eustlb Merge branch 'main' into parakeet-tdt
f9d1a4fc
eustlb test update
43ee7cd7
eustlb test update
c2a0f781
eustlb ensure correct loss computation
1fd7ed78
eustlb kernel loss
7cc9d2e7
eustlb test loss integration
e753eab1
eustlb push to hub pr
ed3fa4dc
eustlb integration tests to rely fully on transcripts
ab66b239
eustlb udpate fixtures
a5ba0c61
eustlb we don't need to monkey patch with numba anymore!
48279a67
eustlb fix pipeline usage
1d7680d4
eustlb nit
59ddcedb
eustlb fix usage
31490d19
ebezzam Pass through tests and examples: improve kernel fallback, update with…
d8eb1b6f
ebezzam Update checkpoint
1f1b912d
ebezzam Merge branch 'main' into parakeet-tdt
9ab08d1e
ebezzam Add TDT to mapping after merge.
fd9f8b1b
ebezzam Fix lasr generate test.
136f6768
ebezzam Output attention mask if labels provided for computing loss.
833d2890
ebezzam Apply suggestion from @ArthurZucker
a1c62a1f
ebezzam Improve ParakeetTDTDecoderCache definition and usage.
86835704
ebezzam Remove tuple parsing.
1d4b0f43
ebezzam processor refactor
a418ecae
ebezzam Merge branch 'parakeet-tdt' of github.com:lmaksym/transformers into p…
5d0c6318
ebezzam Update conversion.
5c603c17
ebezzam ebezzam marked this pull request as draft 38 days ago
ebezzam Merge branch 'main' into tdt_loss_kernel
09ba99c4
ebezzam Modular after merge.
e743b2d2
github-actions
ebezzam
ebezzam commented on 2026-05-19
ebezzam Don't allow all kernels.
8d09cb6a
ebezzam
ebezzam commented on 2026-05-19
HuggingFaceDocBuilderDev

Login to write a write a comment.

Login via GitHub

Reviewers
No reviews
Assignees
No one assigned
Labels
Milestone