RunPod

R

RunPod

We're a community of enthusiasts, engineers, and enterprises, all sharing insights on AI, Machine Learning and GPUs!

Join

⚡|serverless

⛅|pods-clusters

Stable diffusion checkpoint list empty with Better Forge template

Following the instructions here: https://blog.runpod.io/introducing-better-forge-spin-up-new-stable-diffusion-pods-quicker-than-before/ which were written just last month... I have downloaded two different checkpoints from civit into the stable-diffusion-webui-forge/models/Stable-diffusion folder. However the dropdown list of checkpoints in the webui is empty. I have tried clicking the refresh button, refreshing the page, and restarting the pod, but no matter what I do the models will not show up. What is going on?...

Can't select 2x GPU for my old pod, while I could start a new pod with the same GPU setup

Might be a stupid question, but I had a pod running yesterday with 2 H100 PCIe and I can start my pod only with 0 or 1 GPU, which looks like an availability issue. But If I want to deploy a new pod, I can choose 2 H100 PCIe and the availability is medium.

Udp ports

Hi, currently trying to run a webrtc based streaming application on runpod. The application would connect to a client using webrtc udp connection. Is this something possible to do with runpod?...

Cannot set TCP-Port 3000 for Dreambooth

Hi all, i want to use Dreambooth, and i tryed to set the Port via Fuser and in the Configuration of the pod, but its notpossible to open the port. Fuser throws permission denied (fuser -k 3000/tcp). Any suggestions for me, please?? 😦

A40 availability

There are a couple of >1 month old posts about this but it seems to be an issue again, A40s have become pretty much entirely unavailable other than at weird times (~7am GMT) and it's been like this for about a week now, what's going on? Availability seems unusually poor, I've never known it like this, I've got quite a lot of credit that I can't use.

SGLANG load LLM Model

I am trying to load LLM model using Pods by using sglang template. Here is my config: When I start the pod, it did not loading the model instead the container log keep showing cuda things(license,version). May I know what is the reason?...
No description

Unable to start pod using GraphQL

I am trying to create a pod using the GraphQL endpoint but I am getting 400 status response, here are the request and response for the same. Please let me know how to get this working. ``` Sending GraphQL query: mutation {...

Differentiating between the pod state, "starting" vs "stopping"

When I start a pod and fetch it's details through the grapQL api, the "runtime" is None but when I stop it, the "runtime" is None as well. Is there a way to differentiate between these two states ?

Building and deploying dockerfile from Pod

Has anyone figured out how to properly build and push a dockerfile in a runpod pod? https://docs.runpod.io/tutorials/pods/build-docker-images I'm trying to do this for my custom serverless worker bc my personal pc has some docker issues that i havent figured out over multiple days of debugging (i think wsl got corrupted somehow but that's a different issue). But every time i try to run bazel run //:push_custom_image, it shows the following error: ``` WARNING: Target pattern parsing failed....

Still waiting for logs but I can Console in?

My container is still in Waiting For Logs state, but I can access it through the web console, run services, and access them through http. Even after doing this, it is still showing waiting for logs. The docker entrypoint script does not seem to have completed as the services it should run are not started. What logs does the container logs on runpod ui actually look in? Is it what the container prints to the screen or a log file?...

price

from which point of the provision of the pod in getting credited?

Assistance with Deploying AI App on RunPod

I recently purchased RunPod to deploy my AI app, but I could use some guidance on implementing my end-to-end project. I have a project folder, "X," that includes my custom models (in both ONNX and PyTorch formats) and Flask APIs. It’s working well locally, but I'm a bit confused about transitioning to RunPod. Specifically, I’m unsure about how to best leverage Pods, serverless options, and templates to set it up on your platform. I've explored the documentation but still have questions on structuring and deploying it effectively. Could you or someone from your team provide guidance or resources to help me set up and run my project on RunPod?...

cant ssh to runpod

-- RUNPOD.IO -- Enjoy your Pod #1r3czjoca6n3zh ^_^ Error response from daemon: Container b7b51b7b9a1b7e03f346032b3339de15d9465317e632972e2b5be6b3584d8759 is not running...

How fast are network volumes?

Hey, this kind of belongs into here, and also into serverless. For a client we're currently architecting some stuff, and the question we're having is, just exactly how fast are network volumes. My limited benchmarks make it feel like they add about a minute for an SDXL Model to the execution time, because the Model needs to be loaded to RAM. This seems to be much faster with local storage. What are your experiences, any pieces of advice? Any gotchas? Thank you so much 🙂 ...

Please help me.

I have to deploy backend that built using flask on VPS with GPU. The backend performs object detection using YOLO. How to do it on RunPod? And what is this error?...

very slow network storage

I deploy pod with network storage on US-KS-2 and its extremally slow (storage disk)
pyton -m venv venv
pyton -m venv venv
...

Pod not starting up properly anymore

When I deploy a pod with the "RunPod Stable Diffusion" template on demand, its not starting up properly, even if I wait for an hour. I can not launch jupyterlab or the sd webui. Did something change with the platform? This used to be a very easy and straightforward process withouth issues....

Putty for SSH? Any clues?

I'm trying to connect with putty. I think I converted the ed25519 to .ppk correctly. I know how to use it to authenticate. I can connect and get prompted to "login as:", so I just use "root" because I can't find a username in documentation. Which username do we use? Am I doing this right?...

Unable to Connect AWS to RunPod

I am encountering an issue while attempting to connect AWS to RunPod. Despite multiple attempts, the connection fails, and we have been unable to establish a successful link between the two services. Any guidance or troubleshooting steps to fix this issue would be greatly appreciated. Thank you in advance for your support!