RunpodR
Runpod15mo ago
8 replies
VidimusWolf

Keeping Flashboot active?

It is my understanding that Flashboot is only active for "a while" after each request, and then it is disabled as the instance goes to a deeper sleep. Sadly for me it takes a whopping 70-90 seconds of just delay to cold start after a long delay (running llama-2-13b-chat-hf off the 48GB GPUs e.g. A40), I don't know if I am doing something wrong there as I see others on this forum are getting much much faster start times. However, on consecutive jobs, the delay drops down to 1-3 seconds. What is the minimum time between requests to keep Flashboot functional? I assume this is some "secret", but would e.g. 1 job every 10 minutes do the trick?
Was this page helpful?