Gaudi3 CI #38790

IlyasMoutawwakil merged 73 commits into main from gaudi-ci
IlyasMoutawwakil
IlyasMoutawwakil first pass
9f107d37
IlyasMoutawwakil IlyasMoutawwakil force pushed from f4787997 to 9f107d37 1 year ago
HuggingFaceDocBuilderDev
ydshieh
ydshieh commented on 2025-06-12
IlyasMoutawwakil trigger ci on dummy channel and dummy repo id
4cf64ab1
ydshieh
ydshieh
IlyasMoutawwakil add a model jobs wokflow and reduce model splits
f10b1960
IlyasMoutawwakil add runner scale set
7815fc4a
IlyasMoutawwakil add docker image
7896ef36
IlyasMoutawwakil test with model jobs only
8dcbe6b8
IlyasMoutawwakil remove mounting
584e697e
IlyasMoutawwakil from model jobs as well
25e19127
IlyasMoutawwakil test again
67c566f6
ydshieh check
3e79fdc0
IlyasMoutawwakil use runner groups
2d3cd447
IlyasMoutawwakil checkout transformers code
1d2e747f
IlyasMoutawwakil remove clean up
ab7bd005
IlyasMoutawwakil use curr dir
920e1bf0
IlyasMoutawwakil test again
50b280b5
IlyasMoutawwakil test single workflow
1a1e70aa
IlyasMoutawwakil test
faa3d032
IlyasMoutawwakil test withput the ids
d6dfa081
IlyasMoutawwakil fix
be06f2ec
IlyasMoutawwakil again
7b5ba3a6
IlyasMoutawwakil test
f9de3144
IlyasMoutawwakil fix
7174fe01
ydshieh
IlyasMoutawwakil fix
4c39c9c4
IlyasMoutawwakil matrix folders
544222c5
IlyasMoutawwakil two splits
75347394
IlyasMoutawwakil fix
98d53b69
IlyasMoutawwakil test
ee94b556
IlyasMoutawwakil fix
1b45fde5
IlyasMoutawwakil run other tests
7bbde0ee
IlyasMoutawwakil fix dep
e55636d3
IlyasMoutawwakil use canonical job names
79caf899
IlyasMoutawwakil fix and non lazy mode
cd30bc39
IlyasMoutawwakil add fsdp tests and disable model ci entirely for now
003cdce6
IlyasMoutawwakil add librosa and soundfile
27102d59
IlyasMoutawwakil use model jobs for fsdp tests
d881b66f
IlyasMoutawwakil test model jobs as well
039a9b8c
IlyasMoutawwakil fix
d0d0fb12
IlyasMoutawwakil quant matrix
0308c71c
IlyasMoutawwakil fix
249b0780
IlyasMoutawwakil remove omp num threads
0ff48353
IlyasMoutawwakil IlyasMoutawwakil force pushed from 04a02fb7 to 0ff48353 362 days ago
IlyasMoutawwakil default to hpu_backend if not passed to torch.compile
cff1bb66
IlyasMoutawwakil fix
7bc0e12e
IlyasMoutawwakil enable int64
401ed188
IlyasMoutawwakil test mounting the cache
f1f843d2
IlyasMoutawwakil remove parallelism flags
8847e35a
IlyasMoutawwakil fix device dispatch
a97732b5
IlyasMoutawwakil fix sdpa atol/rtol for hpu
edd35f14
IlyasMoutawwakil
IlyasMoutawwakil commented on 2025-06-17
IlyasMoutawwakil force hpu_backend all the time
8b6d0a02
IlyasMoutawwakil add run_first decorator and disable fsdp2
7bc8e378
IlyasMoutawwakil add deepspeed run_first decorators
d9c2cb70
IlyasMoutawwakil fix multiprocessing on habana
38b35958
IlyasMoutawwakil fix more distributed tests that require running first
23b34de0
IlyasMoutawwakil Merge branch 'main' into gaudi-ci
95b9ac6e
IlyasMoutawwakil fix machine types
bf706f6b
IlyasMoutawwakil Merge branch 'main' into gaudi-ci
b3ec8e8c
IlyasMoutawwakil skip parallelism tests
db46c855
IlyasMoutawwakil use new slack channel and report repo
ab951b1e
IlyasMoutawwakil add cap_sys
a8ec639b
IlyasMoutawwakil fix bug in test_trainer_distributed
e385d118
IlyasMoutawwakil push
4189a566
IlyasMoutawwakil IlyasMoutawwakil force pushed from 69a62c5b to 4189a566 359 days ago
IlyasMoutawwakil remove forced test splits
4e97c343
IlyasMoutawwakil IlyasMoutawwakil marked this pull request as ready for review 358 days ago
IlyasMoutawwakil Merge branch 'main' into gaudi-ci
4cdb35b9
IlyasMoutawwakil IlyasMoutawwakil requested a review from regisss regisss 358 days ago
IlyasMoutawwakil
IlyasMoutawwakil commented on 2025-06-19
IlyasMoutawwakil
IlyasMoutawwakil commented on 2025-06-19
IlyasMoutawwakil
IlyasMoutawwakil commented on 2025-06-19
IlyasMoutawwakil
IlyasMoutawwakil commented on 2025-06-19
IlyasMoutawwakil IlyasMoutawwakil requested a review from ydshieh ydshieh 358 days ago
ydshieh
ydshieh approved these changes on 2025-06-19
IlyasMoutawwakil added comment for hpu_backend_compile patch
9301161e
IlyasMoutawwakil added comment for squad_convert_examples_to_features patch
60bbe09f
IlyasMoutawwakil test
30a69ede
IlyasMoutawwakil IlyasMoutawwakil force pushed from e8f223ec to 30a69ede 358 days ago
IlyasMoutawwakil remove require_torch_gpu from fsdpv2 tests
eab20c99
IlyasMoutawwakil update synapse ai version
e9c4cee0
regisss
regisss commented on 2025-06-19
IlyasMoutawwakil fix fp8
a0c10767
IlyasMoutawwakil run all models
fb986b27
IlyasMoutawwakil add parallelism
6e3f5994
ydshieh
ydshieh
IlyasMoutawwakil style
0eb7a80b
regisss
regisss approved these changes on 2025-06-20
IlyasMoutawwakil Merge branch 'main' into gaudi-ci
d7478dcd
IlyasMoutawwakil IlyasMoutawwakil requested a review from ydshieh ydshieh 354 days ago
IlyasMoutawwakil
IlyasMoutawwakil commented on 2025-06-23
IlyasMoutawwakil Apply suggestions from code review
9d867f16
IlyasMoutawwakil IlyasMoutawwakil removed review request from ydshieh ydshieh 354 days ago
IlyasMoutawwakil IlyasMoutawwakil merged 984ff89e into main 354 days ago
IlyasMoutawwakil IlyasMoutawwakil deleted the gaudi-ci branch 354 days ago

Login to write a write a comment.

Login via GitHub

Reviewers
Assignees
No one assigned
Labels
Milestone