transformers
[deepseek_v4] Add DeepseekV4NextNPredictor class for MTP draft-head support
#46127
Open

[deepseek_v4] Add DeepseekV4NextNPredictor class for MTP draft-head support #46127

pasta-paul wants to merge 2 commits into huggingface:main from pasta-paul:dsv4-mtp-class
pasta-paul
pasta-paul Add DeepseekV4NextNPredictor class for MTP draft-head support
e229b31d
pasta-paul
pasta-paul Use layer_type=sliding_attention for MTP block (not last main-layer's…
9f565da0
github-actions
pasta-paul
Rocketknight1
pasta-paul
pasta-paul
pasta-paul
ArthurZucker
pasta-paul

Login to write a write a comment.

Login via GitHub

Reviewers
No reviews
Assignees
No one assigned
Labels
Milestone