DeepSpeed
[Draft] Add On-Policy Distillation (OPSD) Trainer in DeepSpeed
#8027
Open
Go
Login via GitHub
Home
Pricing
FAQ
Install
Login
via GitHub
Overview
Commits
4
Changes
View On
GitHub
[Draft] Add On-Policy Distillation (OPSD) Trainer in DeepSpeed
#8027
PKUWZP
wants to merge 4 commits into
deepspeedai:master
from
PKUWZP:zhipwang_opd_pr
Add OPSD example: config, divergence losses, utils + tests
932f0b52
Add OPSD frozen teacher with CPU logit cache + tests
14d8fe7e
Add OPSD trainer, hybrid-engine rollout, and end-to-end entry point
837787a0
Add OPSD vLLM rollout scaffold, Qwen2/Qwen3 weight bridges, and README
6384396b
PKUWZP
requested a review
from
tohtana
9 days ago
PKUWZP
changed the title
Add On-Policy Distillation (OPSD) example app
[Draft] Add On-Policy Distillation (OPSD) Trainer in DeepSpeed
9 days ago
chatgpt-codex-connector
commented on 2026-05-26
Login to write a write a comment.
Login via GitHub
Reviewers
chatgpt-codex-connector
tohtana
Assignees
No one assigned
Labels
None yet
Milestone
No milestone
Login to write a write a comment.
Login via GitHub