n00b multi gpu question

Hello hello!

I created a 4 gpu pod (screenshot), then asked pytorch what devices it saw, and it just saw one - what's the dumb thing i'm missing?

Thanks

Solution

Alright so, I restarted the pod (with the env var you suggested) and CUDA reported zero gpus

Then I removed the env var, restarted, and CUDA now reports four GPUS. no change from previous code/config

Either:
- somehow the pip install commands messed up CUDA, and restarting fixed that
- runpod is flakey on if the gpus get attached or not

Jump to solution

n00b multi gpu question

Similar Threads

n00b multi gpu question

Similar Threads

Similar Threads

Similar Threads