RunpodR
Runpod•2y ago
David Mack

n00b multi gpu question

Hello hello!

I created a 4 gpu pod (screenshot), then asked pytorch what devices it saw, and it just saw one - what's the dumb thing i'm missing?

Thanks 🙂
Screenshot_2024-06-11_at_5.49.17_PM.png
Screenshot_2024-06-11_at_5.49.23_PM.png
Solution
Alright so, I restarted the pod (with the env var you suggested) and CUDA reported zero gpus

Then I removed the env var, restarted, and CUDA now reports four GPUS. no change from previous code/config

Either:
  • somehow the pip install commands messed up CUDA, and restarting fixed that
  • runpod is flakey on if the gpus get attached or not
Was this page helpful?