Some query take a long time than usual

I notice that some query take very long time (stucking in delay), why ? Ps. I notice thar problem occur when I leave server idle for a while
No description
6 Replies
Jason
Jason3w ago
maybe cold start the model needs to be loaded in the worker
tanawatl
tanawatlOP2w ago
No description
tanawatl
tanawatlOP2w ago
Can i upload chkpoint to runpod storage for using serverless ?
Jason
Jason2w ago
yes, sure its just like a pod, you just need the code to download it from inside serverless then put it into /runpod-volume for network volume/storage
tanawatl
tanawatlOP2w ago
So i need to modify code on vllm worker on runpod git ?

Did you find this page helpful?