DeepSpeed
Update Domino for Llama3
#7084
Open

Update Domino for Llama3 #7084

shenzheyu wants to merge 20 commits into deepspeedai:master from shenzheyu:master
shenzheyu
shenzheyu shenzheyu requested a review from GuanhuaWang GuanhuaWang 288 days ago
shenzheyu shenzheyu requested a review from hwchen2017 hwchen2017 288 days ago
GuanhuaWang
loadams Update setup.py handling of ROCm cupy (#7051)
963f11bd
loadams nv-ds-chat breaks with latest transformers (#7052)
f538f55c
shenzheyu update for llama3
ef6c29b7
shenzheyu fix format
54a14214
tjruwase Rename aio_thread_count to intra_op_parallelism (#7056)
42395260
inkcherry add autoTP training zero2 tests (#7049)
f3ce29fa
wukong1992 Fix, bf16 optimizer remove dup loop (#7054)
5a725723
loadams Update version.txt after 0.16.4 release (#7063)
adb4e084
stas00 fix an outdated doc wrt CUDA_VISIBLE_DEVICES (#7058)
aeaf0ce4
siqi654321 Tecorigin sdaa accelerator (#6903)
ef1cbd08
loadams Handle special case of libuv for Windows (#7064)
9df70c28
loadams Update README with info on newest accelerator (#7065)
1faaf1ea
U-rara Bug Fix for offload_states API (#7050)
fb6d9a87
loadams Fix TOCTOU issues, switch to fstat (#7067)
3638f9cb
ShellyNR config torch to avoid graph breaks caused by logger (#6999)
f0db0104
Yejing-Lai Fix meta load tensor imcompatible issue (#7073)
b9a77e2a
loadams Replace calls to `python setup.py sdist` with `python -m build --sdis…
68309753
loadams Revert "Handle special case of libuv for Windows (#7064)" (#7076)
dddc7cf6
Yejing-Lai Add DeepseekV3 AutoTP. (#7045)
91d05e2b
shenzheyu shenzheyu force pushed from 6d32bb4f to 91d05e2b 281 days ago
shenzheyu shenzheyu requested a review from tjruwase tjruwase 281 days ago
shenzheyu shenzheyu requested a review from tohtana tohtana 281 days ago
shenzheyu shenzheyu requested a review from jomayeri jomayeri 281 days ago
shenzheyu shenzheyu requested a review from loadams loadams 281 days ago
shenzheyu Merge branch 'master' into master
be199d2d
loadams
GuanhuaWang
hwchen2017 hwchen2017 marked this pull request as draft 99 days ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone