Failed to initialize NVML: Unknown Error

(compress) root@1908bfec7b85:/workspace# nvidia-smi
Failed to initialize NVML: Unknown Error
Failed to initialize NVML: Unknown Error
Every hour or so on my runpod instance, I get the above nvidia error. I'm not changing anything with the machine -- I have to restart it to fix it. Any ideas? Thanks
3 Replies
Unknown User
Unknown User16mo ago
Message Not Public
Sign In & Join Server To View
yhlong00000
yhlong0000016mo ago
looks like something need to be changed for the host server, might want to create a support ticket with pod id and attach the link nerdylive just post, hope they can fix it.
Unknown User
Unknown User16mo ago
Message Not Public
Sign In & Join Server To View

Did you find this page helpful?