Failed to initialize NVML: Unknown Error

Not sure, if I am doign something wrong, or what is happening. But every 20-30 ish min the pod restarts and it seems like I lose connection with the GPU until I restart the pod manually. Runnign RTX 5090.
1 Reply
Dj
Dj2w ago
This is interesting, are you still seeing this problem?

Did you find this page helpful?