DeepSpeed
fix gemma4 num attention head bugs
#7975
Closed

fix gemma4 num attention head bugs #7975

mingxiang1006 wants to merge 14 commits into deepspeedai:master from mingxiang1006:master
mingxiang1006
fix gemma4 num attention head bugs
62ac4c6b
mingxiang1006 mingxiang1006 requested a review from tjruwase tjruwase 55 days ago
mingxiang1006 mingxiang1006 requested a review from tohtana tohtana 55 days ago
chatgpt-codex-connector
delock
revise logic of fallback for text config
96d503df
mingxiang1006
delock
sfc-gh-truwase sfc-gh-truwase requested a review from delock delock 50 days ago
delock
mingxiang1006 Merge branch 'master' into master
6a5f998d
fix gemma4 num attention head bugs
c7aa332c
revise logic of fallback for text config
1f2af84d
Cursx fix(fp_quantizer): fix UB and negative shift warnings in fp_quantize_…
d7c1ca6c
mingxiang1006
mingxiang1006 commented on 2026-04-21
Cursx fix(op_builder): avoid duplicate/wrong -gencode flags (#7974)
c604c942
Flamefire Rename dequantization template parameters (#7976)
175869f1
tohtana Avoid CUDA reinit error in CI tests (#7977)
0ea3c5e9
Merge branch 'master' of https://github.com/mingxiang1006/DeepSpeed
fc2e3429
delock
delock
delock commented on 2026-04-22
remove fix here comment
e385c29a
delock
update text fallback
e5b921f8
delock
delock Merge branch 'master' into master
dd34cd99
mingxiang1006
DCO remediation sign-off
22b31c0e
delock
mingxiang1006
delock
delock
mingxiang1006
delock
delock delock closed this 20 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone