Whenever my endpoint receives new requests and it autoscales to create new pods, a few of the pods get stuck while booting and don't respond. Also, while this happens I am being charged because somehow that is considered as uptime, certainly not a fault with my code and multiple other pods work fine on boot
Continue the conversation
Join the Discord to ask follow-up questions and connect with the community
R
Runpod
We're a community of enthusiasts, engineers, and enterprises, all sharing insights on AI, Machine Learning and GPUs!