[deepseek_v4] Add DeepseekV4NextNPredictor class for MTP draft-head support #46127
Add DeepseekV4NextNPredictor class for MTP draft-head support
e229b31d
Use layer_type=sliding_attention for MTP block (not last main-layer's…
9f565da0
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub