Why too long delay time even if I have active worker ?
I have set the active worker to 1. I am manually testing the response delay. I submit the next task only after the previous task is completed, so there is no waiting time. However, many times, the delay time is still very long, sometimes even reaching more than 4 seconds. Why is this? In my code, the model has been loaded before runpod.serverless.start({"handler": run})
Recent Announcements
Continue the conversation
Join the Discord to ask follow-up questions and connect with the community
R
Runpod
We're a community of enthusiasts, engineers, and enterprises, all sharing insights on AI, Machine Learning and GPUs!