Dead instance on launch

GPU seems to have stopped...logs don't show any errors, but there is no activity
Migrate pod volume to Network volume
Unable to modify owner of network volume
/home inside the pod, attempting to create a user home dir. However, I am unable to change the owner away from root.Can't run extensions in stable diffusion
Cuda not connecting to image provisioned for GPU
Requests using RUNPOD_API_KEY fail with 403 unauthorized.
403 Forbidden and an empty response body....run commands remotely on my pod
$ runpodctl exec python /ru.py --pod_id <redacted>
Running remote Python shell...
Waiting for Pod to come online...
$ runpodctl exec python /ru.py --pod_id <redacted>
Running remote Python shell...
Waiting for Pod to come online...
Flux Gym
Http bad gateway error

LLM training process killed/SSH terminal disconnected, seemingly at random, no CUDA/OOM error in log
2 GPU but only one work
deploy fail, can't get template, networking, could not resolve host github.com
I can not do training out of memory error I got)
How to self terminate pod on crash
API endpoint
Having to re-download all models
Error while deserializing header: HeaderTooSmall
Trouble training sdxl lora with kohya
vLLM and multiple GPUs