R
Runpod2w ago
dpk

Stuck initializing vLLM

I'm using the official runpod vLLM image for a serverless endpoint, using all default settings (besides the model), and my workers are all stuck initializing. I have a job in the queue (just hitting /v1/models) for 20+ minutes now and there are five workers (3 regular + 2 extra) just spinning "initializing". I don't see anything in the worker logs. What am I doing wrong, and more to the point, how do I figure out what I'm doing wrong? Just seeing logs would be nice. (Endpoint id: 99ky9cmdcjj2hm)
15 Replies
mentiro
mentiro2w ago
I'm having this problem now - did you figure out any solutions?
Hugo
Hugo7d ago
same issue here. been stuck for 14 hours now
No description
Poddy
Poddy7d ago
@dpk
Escalated To Zendesk
The thread has been escalated to Zendesk!
dpk
dpkOP7d ago
I gave up. I've had this issue several times in the past, I assume it's just the way serverless is
Unknown User
Unknown User6d ago
Message Not Public
Sign In & Join Server To View
woody0538
woody05386d ago
@Jason This issue still persists today. I can't launch a serverless endpoint using vllm with default settings.
Unknown User
Unknown User6d ago
Message Not Public
Sign In & Join Server To View
woody0538
woody05386d ago
Yes, I just started one about 10 minutes ago using a docker image and still initializing
No description
Hugo
Hugo6d ago
doesnt roll out without being stuck initializing for minimum 12 hours for me
No description
Unknown User
Unknown User6d ago
Message Not Public
Sign In & Join Server To View
Hugo
Hugo6d ago
cant find them it was 12+ hours ago. some of the workers: b3b4sx1nxx51wh / aurqsfa3kndbu8
Unknown_User
Unknown_User5d ago
that link does not work btw
Unknown User
Unknown User5d ago
Message Not Public
Sign In & Join Server To View
mentiro
mentiro2d ago
I'm still having this problem. I had heard that you were going to be releasing an update to possibly fix this yesterday. I'm sure the AWS outage overshadowed that, but any update on fixing this issue?
Unknown User
Unknown User21h ago
Message Not Public
Sign In & Join Server To View

Did you find this page helpful?