DeepSpeed
Add EXAONE 4.0 model support for Inference V2
#7853
Merged

Add EXAONE 4.0 model support for Inference V2 #7853

Bias92
Bias92 Bias92 requested a review from hwchen2017 hwchen2017 38 days ago
Bias92 Bias92 requested a review from tohtana tohtana 38 days ago
Bias92 Bias92 force pushed from bd52e9d0 to 400d05a3 38 days ago
chatgpt-codex-connector
chatgpt-codex-connector commented on 2026-02-14
tohtana
tohtana commented on 2026-02-15
Bias92
PKUWZP PKUWZP requested a review from PKUWZP PKUWZP 36 days ago
Bias92 Add EXAONE 4.0 model support for Inference V2
2c312690
Bias92 Fix QK-norm to use local head counts for TP compatibility
81f96e2b
Bias92 Apply yapf formatting
66a7344a
Bias92 Use n_heads_q_local and n_heads_kv_local for GQA compatibility
8ece3c17
Bias92 Bias92 force pushed from fced31a9 to 8ece3c17 36 days ago
Bias92
tohtana
tohtana approved these changes on 2026-02-17
tohtana tohtana enabled auto-merge (squash) 35 days ago
tohtana Merge branch 'master' into add-exaone4-inference-v2
14dc60b3
tohtana tohtana merged f3a9819c into master 35 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone