Optimization: Qwen3 next autoregressive pass #17996
CISC
approved these changes
on 2025-12-13
It's Qwen3 Next, the lean mean token generation machine!
b739b11d
pwilkin
force pushed
from
4a494ab7
to
b739b11d
4 days ago
Apply patches from thread
bd7b7105
Remove recurrent version, only keep chunked and autoregressive
9357f6df
Remove unnecessary conts and asserts
a58f2aca
Remove more extra conts and asserts
2c44d215
Cleanup masking
b1477de0
ggerganov
approved these changes
on 2025-12-16
pwilkin
merged
a5251ca1
into master 3 days ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub