R
RunPodquang son

Runpod error starting container

2024-03-07T14:40:19Z error starting container: Error response from daemon: failed to create task for container: failed to create shim task: OCI runtime create failed: runc create failed: unable to start container process: error during container init: error running hook #0: error running hook: exit status 1, stdout: , stderr: Auto-detected mode as 'legacy' Inconsistency detected by ld.so: ../sysdeps/x86_64/dl-machine.h: 534: elf_machine_rela_relative: Assertion `ELFW(R_TYPE) (reloc->r_info) == R_X86_64_RELATIVE' failed! nvidia-container-cli: detection error: driver rpc error: failed to process request: unknown I restart pod but still error
M
Madiator201140d ago
Can you provide more informations?
A
ashleyk40d ago
Pod id will also be useful
QS
quang son40d ago
pod id is: 7k5m1uf3rz4yoy, EU-RO-1
S
Satish40d ago
Looks like there is an issue with hardware, checking with DC team on this. If you are using network volume, please terminate the pod and create a new one so you can access your data. If you are using local storage, then we need to wait until the hardware issue is fixed. New pod will be placed in to different host. Before terminating the old pod, please first create new pod and verify data if its network storage.
QS
quang son34d ago
Yes. We have terminated. Another issue @Satish @Papa Madiator 2024-03-13T15:30:27Z start container 2024-03-13T15:30:30Z error starting container: Error response from daemon: driver failed programming external connectivity on endpoint 4bjf5n4vpbv46f-0 (8de4860d317dc036a9e7527a00d592ee7d1a29b8262ac119438c8b579757f7c4): Error starting userland proxy: listen tcp4 0.0.0.0:40168: bind: address already in use 2024-03-13T15:30:31Z start container 2024-03-13T15:30:32Z error starting container: Error response from daemon: driver failed programming external connectivity on endpoint 4bjf5n4vpbv46f-0 (507df3a0f7d63f508f43d9709b724576e2ef3dac3f7ebda732741e3e311e4822): Error starting userland proxy: listen tcp4 0.0.0.0:40168: bind: address already in use 2024-03-13T15:30:49Z start container 2024-03-13T15:30:52Z error starting container: Error response from daemon: driver failed programming external connectivity on endpoint 4bjf5n4vpbv46f-0 (1cd76168203a4e67ac7c0dd9e6b70325d35ddb46d6dda00f388454475bbb55d0): Error starting userland proxy: listen tcp4 0.0.0.0:40168: bind: address already in use 2024-03-13T15:31:07Z start container
M
Madiator201134d ago
40168 is already being used is it serverless or pod?
QS
quang son34d ago
runpod pod in gpu-cloud pod id: 4bjf5n4vpbv46f I have terminated it
M
Madiator201134d ago
so if you terminated it we will wont know what was issue 😄
QS
quang son19d ago
ok. If I see this issue again, I'll notify you, and don't terminate for debug your side Hi @Madiator2011 Pod ID 9rb76yk8uvvy0q has issues: 2024-03-29T04:17:37Z Status: Image is up to date for runpod/stable-diffusion:web-ui-10.2.1 2024-03-29T04:17:47Z create container runpod/stable-diffusion:web-ui-10.2.1 2024-03-29T04:17:55Z pending image pull runpod/stable-diffusion:web-ui-10.2.1 2024-03-29T04:18:07Z error pulling image: Error response from daemon: Get "https://registry-1.docker.io/v2/": net/http: request canceled while waiting for connection (Client.Timeout exceeded while awaiting headers) cc @Satish
M
Madiator201118d ago
Tried restarting pod?
QS
quang son17d ago
Yes, I have restart pod yet 2024-03-29T15:42:33Z create pod network 2024-03-29T15:42:33Z create container runpod/stable-diffusion:web-ui-10.2.1 2024-03-29T15:42:41Z pending image pull runpod/stable-diffusion:web-ui-10.2.1 2024-03-29T15:42:54Z error pulling image: Error response from daemon: Head "https://registry-1.docker.io/v2/runpod/stable-diffusion/manifests/web-ui-10.2.1": Get "https://auth.docker.io/token?scope=repository%3Arunpod%2Fstable-diffusion%3Apull&service=registry.docker.io": net/http: request canceled while waiting for connection (Client.Timeout exceeded while awaiting headers) PodID: rcdd36seu3owfw I think zone EU-SE-1 has problem. please take care PodID: ive5fzoddxzfbb still error 2024-03-29T23:17:01Z create container runpod/stable-diffusion:web-ui-10.2.1 2024-03-29T23:17:09Z pending image pull runpod/stable-diffusion:web-ui-10.2.1 2024-03-29T23:17:16Z error pulling image: Error response from daemon: Get "https://registry-1.docker.io/v2/": net/http: request canceled while waiting for connection (Client.Timeout exceeded while awaiting headers) PodID: imw1s5o3l2tyhs still error 2024-03-30T09:06:19Z error pulling image: Error response from daemon: Head "https://registry-1.docker.io/v2/runpod/stable-diffusion/manifests/web-ui-10.2.1": Get "https://auth.docker.io/token?scope=repository%3Arunpod%2Fstable-diffusion%3Apull&service=registry.docker.io": net/http: request canceled while waiting for connection (Client.Timeout exceeded while awaiting headers) 2024-03-30T09:06:36Z create container runpod/stable-diffusion:web-ui-10.2.1 2024-03-30T09:06:44Z pending image pull runpod/stable-diffusion:web-ui-10.2.1 2024-03-30T09:07:07Z web-ui-10.2.1 Pulling from runpod/stable-diffusion 2024-03-30T09:07:07Z Digest: sha256:315f00cc67b03de0e04f33f9a3650a3bd019cd4a48c2c7d95060bc5d140e619b PodID: p2oe90065eiswb still error I have many issue on zone EU-SE-1 Any admin help me check please @Satish @Papa Madiator This issue effect many user on our production. Please fix this issue @Polar help me check it please yesterday I raise this issue but still no one support 😦
H
haris17d ago
Hi, I'm not too sure what the issue is here, can you contact support on our site, should be on the bottom right of our dashboard (the purple icon)
QS
quang son17d ago
I have contact but it reply as a bot. No one support
QS
quang son17d ago
No description
S
Satish16d ago
@quang son I don't see the pod p2oe90065eiswb now. In the logs, it says it failed to download the Docker image. The server where the pod tried to create now has other images running fine. Did you try creating the pod again in any other region? I will try running the same image on another machine and let you know if I find anything
Want results from more Discord servers?
Add your server
More Posts
Runpod SD ComfyUI Template missing??Where did the "Runpod SD ComfyUI" template go? Can anyone help? I've been using it extensively for aDockerless dev and deploy, async handler need to use async ?handler.py in HelloWorld project, there is not 'async' before def handler(job): . But in serverlesSomething broken at 1am UTCSomething was broken at 1am UTC which caused a HUGE spike in my cold start and delay times.Should I use Data Centers or Network Volume when confige serverless endpoint ?My project is an AI portrait app targeting global users. The advantage of using data centers is the Pod OutageCurrently taking 100x longer to pull the docker image and when it eventually builds I have an API seAre stream endpoints not working?This is a temp endpoint just to show you all. /stream isn't available, what's up?Cuda - Out of Memory error when the 2nd GPU not utilizedI have a pod with 2 x 80 GB PCIe and I am trying to load and run Smaug-72B-v0.1 LLM. The problem is,Postman returns either 401 Unauthorized, or when the request can be sent it returns as Failed, errorPostman reads the following, when I send runsync request from runpod tutorial (from generativelabs) Backdrop Build V3 Credits missingHi team, I hope this message finds you well. I am writing to follow up on the recent offer I receivText-generation-inference on serverless endpointsHi, I don't have much experience neither with llms nor with python, so I always just use this image When on 4000 ADA, it's RANDOMLY NOT DETECTING GPU!When on 4000 ADA, it's RANDOMLY NOT DETECTING GPU! Yesterday I set it up and it's okay. Today I set Cold Start Time is too longWhen i test a HelloWorld project, run , it take too much time. Worker Configuration as attachment, IWhat happened to the webhook graph?There was a webhook graph for serverless but I can't seem to find it anymore. Was it removed for soHow i can use more than 30 workers?i've tested my task with 30 workers and realized that i need more) is it possible to get 40 or more?What is the caching mechanism of RUNPOD docker image?our Docker image is stored in AWS ECR. We've noticed that every time we update the Docker template ocant get my pod to work righthi im new to runpod im trying to add models and loras to my runpod as well as trying to install runpHi, is there currently an outage to Serverless API?The request are "IN_QUEUE" forever...Error occuredGPU not usableserverless deploymenti want to deploy my llm on serverless endpoint, how can i do that?Can i still access the data of my GPU pod once my account run out of fundsI have a telegram bot running in a GPU pod. It has a postgres database container, it stores all the