lighteval
Autoscaling inference endpoints
#412
Merged

Autoscaling inference endpoints #412

clefourrier merged 15 commits into main from clem_inference_endpoint_autoscale
clefourrier
clefourrier first draft of autoscale
19ab5159
HuggingFaceDocBuilderDev
clefourrier
clefourrier adding better management for restarts and resizes
b4c98948
albertvillanova
albertvillanova commented on 2024-12-03
clefourrier
clefourrier
clefourrier Merge branch 'main' into clem_inference_endpoint_autoscale
56111ca9
albertvillanova
clefourrier upgraded autoscale
e69c321b
clefourrier should be working now!
18a88418
clefourrier added pause option
b59f8973
clefourrier clefourrier changed the title First draft of autoscale Autoscaling inference endpoints 1 year ago
clefourrier clefourrier requested a review from NathanHB NathanHB 1 year ago
clefourrier
albertvillanova
clefourrier
NathanHB
NathanHB commented on 2024-12-03
NathanHB
NathanHB commented on 2024-12-03
albertvillanova
albertvillanova commented on 2024-12-04
clefourrier restore endpoint name vs model name diff
cb6ea93e
clefourrier debug
3ea93e9d
clefourrier clefourrier requested a review from NathanHB NathanHB 1 year ago
clefourrier Merge branch 'main' into clem_inference_endpoint_autoscale
72a978f1
clefourrier added example
b39131f5
clefourrier Merge branch 'main' into clem_inference_endpoint_autoscale
aa0e9e5e
clefourrier fix to parallelism manager - no need for endpoint
99607333
clefourrier fix default batch size override
8b061043
clefourrier Merge branch 'main' into clem_inference_endpoint_autoscale
1193875a
NathanHB
NathanHB commented on 2024-12-04
NathanHB
NathanHB approved these changes on 2024-12-04
clefourrier
clefourrier commented on 2024-12-04
clefourrier Update examples/model_configs/endpoint_model_lite.yaml
2f61f1b1
clefourrier clefourrier merged b68d5bc9 into main 1 year ago

Login to write a write a comment.

Login via GitHub

Assignees
No one assigned
Labels
Milestone