Runpod•3w ago

Potential Issues on Scaling a serverless Endpoint

Hello I want some information regarding scaling a saas project on runpod serverless endpoint.
I am thinking of hiring a develepor to build me a saas project that uses a Kokoro fast api docker container
I want to use 16GB GPU as 1st and 24GB as 2nd on setting up the configuration.
I don't ideally want to go above these due to costs per audio generated.
Audio length for each user will be from 8 to 10 minutes of generated audio.

My only issue is being able to scale to hundreds if not thousands of similtaneous users as the use case grows.

Bottom line is I want to offer this service to websites that will connect via an api or custom code snippet.
Which if impacts there own users then impacts the service I want to provide.

Realistically is runpod able to achieve this or will we most likely run into issues where users will be impacted?

I know I can add more funds for more workers and more GPU's so would this be sufficient or require an enterprise grade solution?

Poddy•12/4/25, 11:14 AM

SSamsun Hello I want some information regarding scaling a saas project on runpod serverl...

Jason•12/4/25, 2:21 PM

yes of course, i think there are alot or at least some people who uses runpod for their company at this scale

Jason•12/4/25, 2:21 PM

yes, of course worker limits are flexible too aslong you can contact the team via support ticket after you hit the limit

JJason yes, of course worker limits are flexible too aslong you can contact the team vi...

SamsunOP•12/4/25, 6:32 PM

Yes but it will only be what resources are available right so if use case spikes and there are no GPU resources available then users will face issues so how does runpod address these scenarios or would require an enterprise grade solution?
The trouble is when start off small like myself how can I scale without running into issues for any of the users

Jason•12/4/25, 11:34 PM

Make sure to select more than 1 gpu types, more than 1 datacenters

And have technical skills ready to fix some unexpected problems that might happen on runpod is key..

Jason•12/4/25, 11:35 PM

Or better also have other provider with more coverage as backup but this can be a advanced solution

SSamsun Yes but it will only be what resources are available right so if use case spikes...

Yobin•12/5/25, 8:56 AM

I have 6 digits users on my serverless, youll be fine tbh

Yobin•12/5/25, 8:57 AM

also @Jason IS CLOUDFLARE DOWN AGAIN?! 6566bugcat36

YYobin also <@340880706865594370> IS CLOUDFLARE DOWN AGAIN?! <a:6566bugcat36:1356575509...

Jason•12/6/25, 2:17 AM

Weww what happened

Potential Issues on Scaling a serverless Endpoint

Similar Threads

Similar Threads

Similar Threads