RunPod

R

RunPod

We're a community of enthusiasts, engineers, and enterprises, all sharing insights on AI, Machine Learning and GPUs!

Join

⚡|serverless

⛅|pods-clusters

Passwordless SSH doesn’t work half the time.

I’m using pods in the secure cloud. Half the time, I can’t SSH in and it asks for a password. My key is in authorized files, all the settings for the ssh server are right, but it won’t accept my key. Debug logging gives no reason why. The template is a standard PyTorch 2.2 template from RunPod. The only thing I can do is set a root password and allow using it for SSH and enter my password every time, which is very annoying. Happens all the time and then every now and then it doesn’t and I can SSH in fine without a password. Nothing different on my end. Same template, same scripts doing the login. ...

Flux in Runpod Stable Diffusion WebUI Forge doesn't work in Runpod, although it seems to be possible

I've seen your tutorial to run Flux in Runpod in your blog, but it doesn't work for me, got many errors that I can't solve, I'm not a programmer, sorry 😦 I would like to install Flux in Forge. Why it doesn't work in the version is running in Runpod, is it going to be possible...

vllm seems not use GPU

i'm using vllm and on the graph, when i launch some request, only cpu usage increase. if i open a terminal and launch nvidia-smi, i didn't see any process too. settings line...
No description

Updated a1111 and now i cant connect to the webui port

used git checkout master and git pull in the terminal to update and now i cant connect the port. im getting a 502 | README | Runpod. I already tried deleting the venv and waiting 30 min; no luck. using the official runpod a1111 template

Pod resume failed: This machine does not have the resources to deploy your pod.

Hello! I'm getting this error: Pod resume failed: This machine does not have the resources to deploy your pod. Please try a different machine My pod is a RTX 3090 , 10GB container disk and 60gb volume disk. How can I prevent this from happening?...

Help! My Port 3000 (A1111 web-ui) isn't starting up.

i'm using the ashleykza/a1111 template. It's been working fine till today when I uploaded some new LoRAs.

Can't update custom nodes ComfyUI

New install , update comfy , try to update comfy manager , nothing happens , What am I doing wrong ?
No description

pod with custom template have no tcp ports exposed

Hi, I just created my custom template, and I set the ports to be exposed in the template, but after I deploy a pod, it has no ports exposed, did I configure something wrong?...
No description

IS disk slow

IS(1 ,I think) disk speed is going at 658 MBps while others like US-OR are going at +4000 MBps.
No description

Community runpod template error (Comfyui ashleykza)

I'm trying to deploy this community runpod template Comfyui Ashleykza, but I'm getting this error. How can I proceed?
Solution:
so runpod/comfyui? Cannot find that. one But I found aitrepreneur/comfyui:2.3.5. Testing it now....
No description

Syncing taking too long?

Hi everyone. I'm using ULTIMATE Stable Diffusion Kohya ComfyUI InvokeAI pods. It works well yesterday, but when I tried to create it again today, it stuck on the sync of A1111 (image attached). I've wait a while for this to go through but no dice. I did this in the secure cloud. However when I tried using community cloud, the syncing went fine. Anyone knows what's happening?...
No description

How to store Model to Network Volume

I am saving my Huggingface model with save_pretrained. Which base path do I pass here so that model is saved to Network Volume instead of Container Disk...
Solution:
It is set in the Template. The default mounts to /workspace Often the best way to accomplish storing models there is to create a symbolic link into /workspace...

Account Drained overnight with nothing running

Spun up a serverless api. Did not use it at all. Got billed 60$ since last night. Could you check what caused this behavior. EnerpriseDna Team...
No description

Unable to start pod with llm-foundry image

I'm trying to launch a pod with llm-foundry https://github.com/mosaicml/llm-foundry/tree/main?tab=readme-ov-file#mosaicml-docker-images but the Pod stuck in initialization without error messages.

How to Run Roop unleashed on Runpod

Hello dears I want to run Roop unleashed on Runpod Can you explain the way please...

Pod unreachable

I cannot connect to pod due to timeout. I am using secure cloud. Doesn't seem very reliable. Somebody experiencing the same? ``` ValueError: Ollama call failed with status code 524. Details: <!DOCTYPE html> <!--[if lt IE 7]> <html class="no-js ie6 oldie" lang="en-US"> <![endif]-->...

Is there a way to transfer disk volume between instances? not through 3rd party cloud.

as title, I got a bill for $300 on GCP for egress 3TB of data here, crying :(((((((

A1111 Stable Diffusion 1.10.0 - problems with Dynamic Promts

Hi, I have problems with Dynamic Prompts, the installation works via Automatic1111 but it does not appear in the GUI. same if I use git: cd /workspace/stable-diffusion-webui git clone https://github.com/adieyal/sd-dynamic-prompts/ extensions/sd-dynamic-prompts...
Solution:
Edit the file:
/workspace/stable-diffusion-webui/webui-user.sh
/workspace/stable-diffusion-webui/webui-user.sh
and remove --skip-install if you want to install extensions from the UI....

URGENT, NEED HELP!

Hello, I am wondering if all pods use AMD CPUs. I am on a 4 * A100-80G GPU instance in OR, and it uses AMD [EPYC 7763] CPU (it is extremely slow, 15 times slower than a normal intel Cascade lake CPU, I don't know if this is caused by container tech). Are there VMs that use intel CPUs and possibly different types of intel CPUs (like different Xeon Platinum in Cascade Lake) And how can I see CPU info before spinning up a VM? Thank you so much in advance!...