huggingface/accelerate

Pull Requests Commits

Fix batch_sampler maybe None error (#3025)

candlewill committed 1 year ago

Verified 2d4f1dda

Fix fp8 benchmark on single GPU (#3032)

muellerzr committed 1 year ago

Verified c0cf860d

Add early support for `torchdata.stateful_dataloader.StatefulDataLoader` within the `Accelerator` (#2895)

byi8220 committed 1 year ago

Verified ad3f574a

Improve config handling and add a zoo (#3029)

muellerzr committed 1 year ago

Verified 1a6af0bd

Add end_training/destroy_pg to everything and unpin numpy (#3030)

muellerzr committed 1 year ago

Verified 52fae096

Fix torch version check (#3024)

muellerzr committed 1 year ago

Verified 7ffe7662

Set correct NPU backend and distributed_type when using transfer_to_npu (#3021)

ArthurinRUC committed 1 year ago

Verified 5536a3a8

Tweak defaults for quantized-typed FP8 TE weights (#3018)

muellerzr committed 1 year ago

Verified 7ec8eab9

destroy process group in `end_training` (#3012)

SunMarc committed 1 year ago

Verified 589fddd3

Wrong import check for TE (#3016)

muellerzr committed 1 year ago

Verified 99c69aaf

fix default value for rank size in cpu threads_per_process assignment logic (#3009)

rbrugaro committed 1 year ago

Verified 00785cd9

Enable FSDP & Deepspeed + FP8 (#2983)

muellerzr committed 1 year ago

Verified a452327e

Fix `find_tied_params` for models with shared layers (#2986)

qubvel committed 1 year ago

Verified 851cf343

update version to 0.34.dev0 (#3007)

SunMarc committed 1 year ago

Verified cd5698bb

Add small util to enable FSDP offloading quickly (#3006)

muellerzr committed 1 year ago

Verified 90d50239

Make env variables optional for FSDP (#2998)

muellerzr committed 1 year ago

Verified 3bde6156

Fix deepspeed tests (#3003)

muellerzr committed 1 year ago

Verified dc3b5ad8

clear memory after offload (#2994)

SunMarc committed 1 year ago

Verified 12a5befd

Support skip_first_batches for XLA (#2966)

yitongh committed 1 year ago

Verified 79ca85c2

Fix typo on warning str: "meta device device" -> "meta device" (#2997)

HeAndres committed 1 year ago

Verified 13d93c4f

Explicit check for `step` when loading the state (#2992)

muellerzr committed 1 year ago

Verified d982751a

Fix gated test (#2993)

muellerzr committed 1 year ago

Verified 95edc68c

Fix bug of clip_grad_norm_ for xla fsdp (#2941)

append-only committed 1 year ago

Verified 288accc0

remove .md to allow proper linking (#2977)

nbroad1881 committed 1 year ago

Verified 83b06101

add MLU devices for rng state saving and loading. (#2940)

huismiling committed 1 year ago

Verified 386f7d28

chore: Update runs-on configuration for CI workflows (#2981)

XciD committed 1 year ago

Verified 308a8e96

Enable Unwrapping for Model State Dicts (FSDP) (#2959)

alex-jw-brooks committed 1 year ago

Verified f35cbd1f

Fix torchvision to be compatible with torch version in CI (#2982)

SunMarc committed 1 year ago

Verified a14260c9

Require safetensors>=0.4.3 (#2957)

byi8220 committed 1 year ago

Verified 32f368ec

feat(ci): add `pip` caching in CI (#2952)

SauravMaheshkar committed 1 year ago

Verified 415eddf1

Newer Older