RunPod

R

RunPod

We're a community of enthusiasts, engineers, and enterprises, all sharing insights on AI, Machine Learning and GPUs!

Join

⚡|serverless

⛅|pods-clusters

pod terminate after command finishes

Hi folks -- it seems like if runpod notices that the entrypoint command for my pod finishes, it restarts the container and runs it again. is that expected, and is there any way to turn that off and have the pod terminate instead of re-running?

waiting for logs....

Hi, I wanted to start a RTX A 4000 pod with stable diffusion, but I got only "waiting for logs" for > 5 min... I've stopped after some time. is there an overload, or have I search the problem on my side. I'm new on runpod.io

Kohya_SS - Clicked "Start Training" button....how can i tell that it's working?

I'm running Kohya_ss through Runpod (via Stable Diffusion Kohya_ss ComfyUI Ultimate template). When I click "Start Training" the GUI gives me no indication that anything is happening. Because of how long this process takes it's hard to know whether an error happened or not. Everything I read seems to suggest that I should be able to see the training happening via the Terminal – if nothing else to confirm that activity is taking place and things are working. ...

GPU pods taking long time to install python packages

Horrible download speeds. it is actually disrupting my productivity as I have to wait for 2-3 hours for a few libraries to download. please look into this and kindly help
No description

GPU don't use

Use Runpod SD ComfyUI and ComfyUI - AI-Dock. But they don't use GPU. Secure Cloud 1 x RTX A4000. ID: h9zlhkckse9sx8. Region RO
No description

4090 GPUs in EU-RO-1 not available or with full memory

When starting up 4090s on this server region they get either stuck in "waiting for logs" before I can access them or their memory is full. please fix asap 🙏

While runnning a python file in my pod, I encounter a ModuleNotFound Error for tkinter

I have installed tkinter using pip and through apt-get install python3-tk . Still getting the error. I don't care about the gui as much as I care that my application completes (the application results are independent of gui tools). What can I do?

google colab image

I used the colab image available at us-docker.pkg.dev/colab-images/public/runtime:latest , the image works and gives the following logs , I added the port 9000 to the http port to expose in the pod settings , but it shows on the dialog after clicking on connect that the http service is not ready yet
No description

Switch off pod after 2 hours

Hello, I'm new with runpod, It seems like I didn't turn off my pod and it used up all my credit. How can I protect myself against it?...

Cannot open 7860 port with Oobabooga Text Generation WebUI template

I cannot open 7860 port with Oobabooga Text Generation WebUI template.

Need password when connect to pod using SSH

when I create a pod I try to connect to it using SSH, I follow the tutorial in the site Doc, when I try to connect it asked me for password, I even create an Ubuntu server pod and test it, it give me the same results asked me for SSH password. Can any one help me with this problem. my OS is Ubuntu 22.04...
Solution:
RunPod official templates have start.sh scripts that inject ssh keys into pod on community templates it’s different some of them add key some not. I made Python package that setups true ssh with password. pip install OhMyRunPod

jupyter notebook

Will connecting to the port of GPU instance show the same progress and jupyter notebook I am running or will it just start another instance with new jupyter notebook environment?

Super slow network speeds on some pods.

Some pods have really really slow network speeds and take an absolute age to install requirements initially, and then uploading / downloading files. §5 - 20 mbs takes around 10 minutes? How does one determine which of these pods has slow networks? As they are really a waste of time and money. I have to try one pod at a time to find one that is fast. This is not great.

Power capped

It appens a lot on french GPU's idk why buy the power is hard capped very low
No description

connect to pod 1 from pod 2

both pods have the same ip, but can't communicate to each other. Why?

How to change from root user?

Sorry for dumb question! I'm trying to start a project by installing packages within my venv but get a warning that i'm still the root user. I tried 'su - [username]' but couldn't find what to use as my username (tried a couple obvious things).
Solution:
you can ignore error about root

http service [port 7860] Not Ready

I broke runpod. NEW to LLM. Use runpod webui/terminal and huggingface. I receive this error regardless of GPU. I am using straight forward bloke 1 click ui. Its worked for weeks. Then recently - no. So this is EsinError/operator error. That is my error when I try to start the terminal.
I see this in Logs: AttributeError: module 'gradio.layouts' has no attribute 'all'...

runpodctl: start spot instance?

Is there a flag that can be used to create a spot instance with runpodctl? Or does it only create on-demand instances?

Upload speed issue

Im not waiting several hours to transfer 5Go of files
No description