Generate: Fix modern llm `generate` calls with `synced_gpus` #34095
sync gpus
7d4c4813
sync gpus
0ee8da43
fix other decoding methods
287911fc
nit
53d8e10f
fix assisted gen (consistent return api)
262c9719
gante
force pushed
to
262c9719
1 year ago
gante
commented
on 2024-10-11
gante
merged
37ea0401
into main 1 year ago
gante
deleted the prepare_sync_gpus branch 1 year ago
Assignees
No one assigned
Login to write a write a comment.
Login via GitHub