Runpod•15mo ago

Best tips for lowering SDXL text2image API startup latency?

I'm currently using the https://github.com/ashleykleynhans/runpod-worker-a1111 along with a network volume. I only use a single model with the sd text2image endpoint and I don't need the UI. Right now, I'm experiencing an 80+ second delay for cold startups on the first request. Do you have any suggestions on how to optimize this (without 1 constant active worker)? Thanks in advance!

9 Replies

heyadoOP•15mo ago

Or is it better to run comfyui api, is that one faster to boot?

Unknown User•15mo ago

Message Not Public

heyadoOP•15mo ago

Ok, maybe not then! 😅

Unknown User•15mo ago

Message Not Public

heyadoOP•15mo ago

Do you know how long it takes for it to become cold?

Unknown User•15mo ago

Message Not Public

heyadoOP•15mo ago

Alright, thanks!

Taco Neko•15mo ago

I'm also experimenting with both as well. Some says the long latency may also caused by using network volume. I haven't really tried package models in the docker image myself as I kind of need a lot of models and loras, not sure if it's worth packaging in the docker image

heyadoOP•15mo ago

Let me know if you find out something useful 🙂 For now I'm keeping the endpoint warm by sending a request every 2 minutes

Gaming

Programming

Best tips for lowering SDXL text2image API startup latency?

Did you find this page helpful?