Runpod

R

Runpod

We're a community of enthusiasts, engineers, and enterprises, all sharing insights on AI, Machine Learning and GPUs!

Join

⚡|serverless

⛅|pods

🔧|api-opensource

📡|instant-clusters

🗂|hub

Custom serverless deployment

I want to host https://hub.docker.com/r/konieshadow/fooocus-api on a serverless endpoint but i am unable to find the related documentation can someone guide me through this please?

Automatic A111 WebUI Serverless on Network Volume

As posted before here on the channel, I am using the Automatic A111 WebUI Serverless on Network Volume by Ashley K. The issue is i unusable, taking between 1 to 3 min to generate a single image. I spent months integrating this API into our app, and now I need to launch to production....

SD Img2Img API does not work with Mask

Hey, guys I am using ashleykleynhans/runpod-worker-a1111. I've tried passing images, PIL objects, links to images, Base64 - error is the same for all - AttributeError "str" has no ".convert"... ...

unsupported model error

in this repo https://github.com/runpod-workers/worker-stable_diffusion_v1 if i use this model id it's working emilianJR/CyberRealistic_V3 but if i use this misri/cyberrealistic_v41BackToBasics it's not working. i am new to diffusers please help me...

Logs are missing.

I'm trying to get the error logs on prediction but all I see are the model loading/downloading while building the docker. How can I get the logs for inference to debug?

error pulling image: Error response from daemon: Get "https://registry-1.docker.io/v2/"

2024-02-14T04:23:33Z create container stevenynie/dreamweaver:20240213192815 2024-02-14T04:23:41Z pending image pull stevenynie/dreamweaver:20240213192815 2024-02-14T04:23:47Z create container stevenynie/dreamweaver:20240213192815 2024-02-14T04:23:47Z pending image pull stevenynie/dreamweaver:20240213192815 2024-02-14T04:24:02Z create container stevenynie/dreamweaver:20240213192815...

Is there a way to access worker ID & job ID from a handler? Would be good for logging + debugging

If Runpod could drop worker ID and job ID into environment variables made available to handlers that would be ace. Would save me a lot of time manually mapping things when trying to debug issues.

Serverless errors in the logs

I am having a lot of these errors in our logs lately. In our app we have nothing that indicates a problem in our backend. We have a lot of good requests going through though. Any idea what this could be related to?...
No description

Issue in pod

i'm facing issue from last 2 days, sometime RTX 4090 generates 60 token/second and sometime it 30-20 token/second to generate same response. don't know what is behind this ????...

ashleykleynhans/runpod-worker-a1111 img2img not working with a mask?

i'm using a modified version of ashleykleynhans/runpod-worker-a1111. the original version has the mask field commented out in the img2img schema file ```py # Setting this as an empty string by default causes the API call to fail # 'mask': {...

max workers set to 2 but endpoint page shows ‘5 idle’

Is this the way it should be?
Solution:
Yes, we cache a couple extra for you. You will never have more than the max workers running however.
No description

[FEATURE REQUEST] Granular selection for Serverless Pod GPUs

Hi team, not sure if this is the correct place to post. - Feature request: I'd like to select specific GPU pods for my server, example the grouped tier with L4, A5000, 3090, could be expanded in singular. Why? TensorRT models now take advantage of RTX GPUs, so been able to select 3090 and 4090, or different groups will be highly beneficial to keep Dockers that work with specific architecture....

Serverless - 404 cannot return results

I'm getting the following error:
{"requestId": "sync-af9a620e-1480-4502-9287-640b30cfcdff-e1", "message": "Failed to return job results. | 404, message='Not Found', url=URL('https://api.runpod.ai/v2/mm8w337d46kypj/job-done/hevyjx14k6tl6p?gpu=NVIDIA+RTX+A4500&isStream=false')", "level": "ERROR"}
{"requestId": "sync-af9a620e-1480-4502-9287-640b30cfcdff-e1", "message": "Finished.", "level": "INFO"}
{"requestId": "sync-af9a620e-1480-4502-9287-640b30cfcdff-e1", "message": "Failed to return job results. | 404, message='Not Found', url=URL('https://api.runpod.ai/v2/mm8w337d46kypj/job-done/hevyjx14k6tl6p?gpu=NVIDIA+RTX+A4500&isStream=false')", "level": "ERROR"}
{"requestId": "sync-af9a620e-1480-4502-9287-640b30cfcdff-e1", "message": "Finished.", "level": "INFO"}
...

Debugging Failed Serverless Requests

How to debug failed serverless requests I have a an automat111 serverless endpoint and everything is working well However on Saturday between 17:00 and 21:00 it looks like requests started to fail and then everything returned to normal again. Is there anyway to get more information on what happened or get the logs for this period ?...
Solution:
Found that the logs tab gives the option to filter logs using a datetime range 👍
No description

webhooks custom updates

Does the job webhook get invoked with runpod.serverless.progress_update calls?

Error generating images

Everything was working fine but now we're getting this error - { "delayTime": 2992,...
Solution:
Do you download model from HuggingFace? If so they page is down

in serverless GPU Is Delay Time also Charged or not??

Solution:
^Yeah so there is: Delay time which consists of: (sitting in the queue + cold start (the machine starting up + maybe some setup code u might have outside the handler.py such as: model = load(modelname) or whatever) + imports of libraries) ...
No description

sdxl

i have confusion that is runpod serverless endpoint charge for whole month or on based on usage
Solution:
based on usage of workers running

Unit for Pricing

I never pay attention to this on the internet: what is the unit here? second or minute?
Solution:
No description