Runpod

R

Runpod

We're a community of enthusiasts, engineers, and enterprises, all sharing insights on AI, Machine Learning and GPUs!

Join

⚡|serverless

⛅|pods

🔧|api-opensource

📡|instant-clusters

🗂|hub

Cannot start fast stable diffusion notebook

It was fine this morning but now im getting error messages when starting the pod such as invalid username and password. When following the link to the LastBen repo on huggingface it seems to no longer exist.
Solution:
Template has been discontinued/abandoned. RunPod should delist it.

how to stop scripts from replacing my configuration

I have a version of ComfyUI together with a version of python and it's dependendencies in a direcotry named /workspace on a storage volume. Everytime this pre_start script runs it wipes out my lib and starts over. I tried modifying the script but it gets regenerated every time I restart the pod. How to stop this?
No description

Trouble with Pod HTTP Service Port

First time user of runpod.io. Went through the steps of creating a Pod, used TheBloke Local LLMs One-Click UI When I click on "Connect" I am not able to connect to the HTTP Service Port. It reads "Not Ready"....
No description

Please help - Connect web to Pod

I am trying to connect my website to the Pod using WebSocket. My website is https and I get errors when trying to connect to a Pod port. Does anyone know how to do something like that? Thanks!...

two pods disappeared .

I was working under company account and both pods seem to have disappeared now screen is showing "you dont have any active pods". These have been running for a while. Company account email got no message about their termination and deletion, they are simply not there anymore.

Can't connect with my POD

I have this problem that I can't connect with my POD. Can you guys please take a look and tell me what is going on?
No description

Can private images on GHCR be fetched using registry credentials?

I'm trying to create a template using a private package on GHCR, but am not sure if I can use registry credentials to do so. Since GitHub requires using personal access tokens, will I just have to manually pull/deploy in a pod, instead of using a RunPod template?
Solution:
You should be able to create a new registry credential with the following format for github packages

How to Estimate the Survival Time of Spot Instances?

I need some advice on estimating the survival time of RunPod Spot instances. I've noticed that sometimes my Spot instances run for several hours without interruption, while other times they get terminated within minutes. This variability makes it challenging to choose between SPOT and ON-DEMAND.

run a function in a pod

Suppose I had a function to do some computation, and I wanted to run that inside a pod - how would I go about doing that entirely from the python sdk?

Impossible to launch a CPU Pods via API

when I try to launch a CPu pods via APi with it's id it just crash, with the graphql api it say : Pod resumed: { errors: [ {...

pod network down

My pod's network went down a while ago and still isn't back - k3c9sctuperq0u is the ID. Obviously I can't get logs or anything. Is there any way I can see when it might be fixed?

Pod crashing due to low regular RAM?

Hey, I am running ComfyUI and my pod keeps crashing at one point in the workflow, the VRam is only at 70% utilised, but the GPU says 100% Does this mean if I found a different pod with more regular Ram, then I could keep going with the workflow?...
No description

where is the stop icon??

i would like to pause my pod, but i can only terminate it??

wasted all my credits trying to figure out how to actually initialize the GPUs in the pod instamces

I tried everything I can think of. installed all the nvidia drivers--everything I would do normally. Could not get any GPU to show as a device. I tried multiple preconfigured pods that said all ready to go but nothing seemed to work properly.

Multi Node training with torchrun/slurm

Has anyone here ever tried multinode on runpod? I am thinking of setting this up but if people have encountered prohibitive network speeds I do not see a reason to.

How to get Public IP and set symmetrical port mapping on Pod via Python SDK

I have created a pod with python in the following way ```python runpod.api_key = os.getenv("RUNPOD_API_KEY") bot_name = 'Testing Pod Public IP 1'...
No description

🆘 We've encountered a serious issue with the machines running in our production environment

🆘 We've encountered a serious issue with the machines running in our production environment on RunPod: the GPU utilization fluctuates wildly, sometimes even dropping to zero, which significantly slows down task execution. Who should I contact?
No description

REST API with Ollama

Hello everyone, I installed ollama and trying to make some request do this API using my pod instance and port and I´m getting no results or 502. I´m using this tutorial: https://docs.runpod.io/tutorials/pods/run-ollama...

Can't create pod via graphQL endpoint but works manually

I'm trying to create a new pod using a given template and networkvolume. I can do this using the website just fine however when I try to duplicate the exact same settings using the podRentInterruptable graphQL mutation I'm getting a There are no longer any instances available with the request specifications. Please try again later. error. Here is the mutation: ...