huggingface/accelerate

Pull Requests Commits

Add our own parse util

muellerzr committed 1 year ago

9a04b8b5

destroy process group in `end_training` (#3012)

SunMarc committed 1 year ago

Verified 589fddd3

Wrong import check for TE (#3016)

muellerzr committed 1 year ago

Verified 99c69aaf

fix default value for rank size in cpu threads_per_process assignment logic (#3009)

rbrugaro committed 1 year ago

Verified 00785cd9

Enable FSDP & Deepspeed + FP8 (#2983)

muellerzr committed 1 year ago

Verified a452327e

Fix `find_tied_params` for models with shared layers (#2986)

qubvel committed 1 year ago

Verified 851cf343

update version to 0.34.dev0 (#3007)

SunMarc committed 1 year ago

Verified cd5698bb

Add small util to enable FSDP offloading quickly (#3006)

muellerzr committed 1 year ago

Verified 90d50239

Make env variables optional for FSDP (#2998)

muellerzr committed 1 year ago

Verified 3bde6156

Fix deepspeed tests (#3003)

muellerzr committed 1 year ago

Verified dc3b5ad8

clear memory after offload (#2994)

SunMarc committed 1 year ago

Verified 12a5befd

Support skip_first_batches for XLA (#2966)

yitongh committed 1 year ago

Verified 79ca85c2

Fix typo on warning str: "meta device device" -> "meta device" (#2997)

HeAndres committed 1 year ago

Verified 13d93c4f

Explicit check for `step` when loading the state (#2992)

muellerzr committed 1 year ago

Verified d982751a

Fix gated test (#2993)

muellerzr committed 1 year ago

Verified 95edc68c

Fix bug of clip_grad_norm_ for xla fsdp (#2941)

append-only committed 1 year ago

Verified 288accc0

remove .md to allow proper linking (#2977)

nbroad1881 committed 1 year ago

Verified 83b06101

add MLU devices for rng state saving and loading. (#2940)

huismiling committed 1 year ago

Verified 386f7d28

chore: Update runs-on configuration for CI workflows (#2981)

XciD committed 1 year ago

Verified 308a8e96

Enable Unwrapping for Model State Dicts (FSDP) (#2959)

alex-jw-brooks committed 1 year ago

Verified f35cbd1f

Fix torchvision to be compatible with torch version in CI (#2982)

SunMarc committed 1 year ago

Verified a14260c9

Require safetensors>=0.4.3 (#2957)

byi8220 committed 1 year ago

Verified 32f368ec

feat(ci): add `pip` caching in CI (#2952)

SauravMaheshkar committed 1 year ago

Verified 415eddf1

Properly handle Params4bit in set_module_tensor_to_device (#2934)

matthewdouglas committed 1 year ago

Verified 23085769

Add `torch.float8_e4m3fn` format `dtype_byte_size` (#2945)

SunMarc committed 1 year ago

Verified a5a3e571

delete CCL env var setting (#2927)

Liangliang-Ma committed 1 year ago

Verified 0af1d8b8

Improve test reliability for Accelerator.free_memory() (#2935)

matthewdouglas committed 1 year ago

Verified d16d7371

Consider pynvml available when installed through the nvidia-ml-py distribution (#2936)

matthewdouglas committed 1 year ago

Verified 7a5c231b

Fix import test (#2931)

muellerzr committed 1 year ago

Verified 4f02bb76

Hotfix PyTorch Version Installation in CI Workflow for Minimum Version Matrix (#2889)

yhna940 committed 1 year ago

Verified 709fd1e4

Older