cog
fd95417d - fix: remove hardcoded 300s setup timeout, add COG_SETUP_TIMEOUT env var (#2826)

Commit
10 days ago
fix: remove hardcoded 300s setup timeout, add COG_SETUP_TIMEOUT env var (#2826) The coglet orchestrator had a hardcoded 300-second internal setup timeout that would kill model setup() before the external orchestrator (director) timeout could fire. This caused flux-dev models to fail at 5 minutes even when director was configured with a 10-minute timeout. Changes: - Change setup_timeout from Duration to Option<Duration>, default None - No internal timeout by default — director is the sole authority - Add COG_SETUP_TIMEOUT env var (seconds) for opt-in internal timeout - Warn on invalid or zero values instead of silently ignoring - Add integration tests for slow setup (happy path) and explicit timeout - Improve test harness: wheel auto-detection, negation support, clean task
Author
Parents
Loading