pod not running
why does this happen so much?
it seems to waste credits for startup and then it idles thinking it's running, but nothing can connect.
6 Replies

like, shouldn't this kind of thing be detected by middleware?
Looking at the logs for this Pod, this error can happen on misconfigured Community Cloud hosts, generally the platform makes an assumption that dhis error sort of doesn't happen. For about 200ms at a time docker tries to start the pod, sees an error, then repeats.

I'll flag this machine with the host who runs it, and I've delisted it in the meantime.
it should do a bit more than that for community hosts since it's RunPod that's doing the billing (in my opinion) it's only so much blame that can go to the end-hosts
The error described is a pretty basic config error, where you're entirely right at the same time we'd expect for the machines to be properly configured in the first place.
I'll make sure we build something for it in our project to identify errors with GPUs.