Delay time of 120,000 ms?

Runpod is advertising <250ms cold start times. I am running a custom ASR model that isnt more than a couple of gigabytes. Total docker image is 11gb.

For some reason, the delay time is inifinite and the request never goes through. Any ideas?
Was this page helpful?