sjt80
sjt80
RRunPod
Created by BAS014 on 2/8/2025 in #⚡|serverless
Workers keep respawning and requests queue indefinetely
Hi @BAS014 were they able to help you? I am having the same problem loading a decent sized model onto 4 x GPU's. I have tried extending the executionTimeout to 30 mins on both the request and on the serverless endpoint configuration but my worker ignores it. It currently 'gives up' on the worker just before 10 mins each time. It's so frustrating as the logs show the model either partially loading in memory or completely loads but moves on to the next worker right before it finishes the job!
28 replies