How to tell how much storage being used in pod? (including network drive)
Can't see training progress after reset

Maintenance - only a Community Cloud issue?

SDK GPU naming specification
How to get a general idea for max volume size on secure cloud?
Template pytorch-1.13.1 lists cuda 11.7.1 version but is actually cuda 11.8?
nvcc --version
and it said the pod was on cuda version 11.8.
Is this normal or an error? I can't seem to run my model because of the cuda version mismatch. For reference, I'm using A40....Can't connect to sfpt
Unable to ssh onto my pod with the public key already on the runpod server
Python modules missing when pod is starting
Unable to connect to Pod

I am having trouble finding the location of the model file when trying to use ComfyUI.

Turn on Confidential Computing
"SSH Public Keys" in account settings are completely ignored
PUBLIC_KEY
variable, that I specify for my pod from my python app. However they are only set when I am connecting with ssh via proxy server. Proxy is extremely slow and does not allow scp to be run through it. When I try to connect directly (via public ip), the ~/.ssh/authorized_keys
is not configured at all with the public key I set in the settings. The env vars that I pass during the pod creation are also missing.
Two problems:
- why isn't the ~/.ssh/authorized_keys
file created and populated with my public key from account settings
- why env variables are missing when connecting directly via public ip to my instance? I assume proxy has some .bashrc which is activated when I connect through it, but why the env vars are not set with -e
parameter in docker run command for the pod?...xargs -0 -L1 -a /proc/1/environ
will list the environment variables that the process is getting, which is launched on container start. if there is a PUBLIC_KEY given to your pod, it will be there. if this process is a bash and doesn't export those variables when starting other processes, it will be the only process who knows about your PUBLIC_KEYIs there an instance type that cannot be taken from you even if you stop the pod?
Kill a pod from the inside?
Performance A100-SXM4-40GB vs A100-SXM4-80GB
API problem
Why is there no indicators of file transfer operations? Am I supposed to guess when they're done?
data didn't persist
Tailscale on Pod
/dev/net/tun
is not available, and using SOCKS5 proxy as described in this article https://tailscale.com/kb/1112/userspace-networking is not an option for us.
Are there any recommendations, how I can run Tailscale?
...