DeepSpeed
Add DataStates-LLM: Asynchronous Checkpointing Engine Support
#7166
Open

Add DataStates-LLM: Asynchronous Checkpointing Engine Support #7166

mauryaavinash95 wants to merge 6 commits into deepspeedai:master from DataStates:dev
mauryaavinash95
mauryaavinash95 mauryaavinash95 requested a review from tjruwase tjruwase 189 days ago
mauryaavinash95 mauryaavinash95 requested a review from tohtana tohtana 189 days ago
mauryaavinash95 mauryaavinash95 requested a review from jomayeri jomayeri 189 days ago
mauryaavinash95 mauryaavinash95 requested a review from loadams loadams 189 days ago
mauryaavinash95 mauryaavinash95 requested a review from GuanhuaWang GuanhuaWang 189 days ago
mauryaavinash95 mauryaavinash95 requested a review from hwchen2017 hwchen2017 189 days ago
mauryaavinash95 mauryaavinash95 changed the title Add DataStates-LLM: Asynchronous Checkpointing Engine Support #5763 Add DataStates-LLM: Asynchronous Checkpointing Engine Support 189 days ago
loadams
mauryaavinash95
loadams
mauryaavinash95
mauryaavinash95
tjruwase
tjruwase commented on 2025-03-25
tjruwase
tjruwase
tjruwase commented on 2025-03-25
mauryaavinash95
tjruwase
mauryaavinash95 mauryaavinash95 force pushed from 968f6ca5 to 1c701d7c 182 days ago
mauryaavinash95
mauryaavinash95 mauryaavinash95 requested a review from tjruwase tjruwase 182 days ago
tjruwase
tjruwase commented on 2025-03-29
mauryaavinash95
tjruwase
mauryaavinash95 mauryaavinash95 requested a review from tjruwase tjruwase 178 days ago
tjruwase
tjruwase commented on 2025-04-02
tjruwase
tjruwase commented on 2025-04-02
tjruwase
tjruwase commented on 2025-04-02
tjruwase
tjruwase commented on 2025-04-02
mauryaavinash95
tjruwase
tjruwase approved these changes on 2025-04-02
tjruwase
mauryaavinash95
mauryaavinash95 mauryaavinash95 force pushed from 11dd8437 to 3a820715 164 days ago
mauryaavinash95 mauryaavinash95 force pushed from 3a820715 to 84f067b6 164 days ago
Add datastates-llm to runtime/checkpoint_engine/readme
27de5424
Fix JSON format in readme for datastates-llm
d9df5807
mauryaavinash95 mauryaavinash95 force pushed from 84f067b6 to 6160140e 164 days ago
Fix formatting issues for DataStates-LLM
12e65a6b
Add preserves_storage_sharing for checkpoint engines
59788f83
Update to Apache-2.0 License, move debloating to checkpointing engine
b1312d17
Fix whitespaces
4651ec29
mauryaavinash95 mauryaavinash95 force pushed from 6160140e to 4651ec29 164 days ago
mauryaavinash95 mauryaavinash95 requested a review from tjruwase tjruwase 164 days ago
loadams
mauryaavinash95
sfc-gh-truwase

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone