Runpod•14mo ago

Runpod GPU use when using a docker image built on mac

I am building serverless applications that are supposed to be using gpu, while testing locally, the pieces that kick off functions that are meant to be using gpu are denoted with the common:

device: str = "cuda" if th.cuda.is_available() else "cpu"

this is required so that when running locally on a mac, the cpu device is used. I would think that in a docker image built on a mac, but with a amd64 machine type specified in the build command, that when its deployed on a server that has a cuda base image, cuda gpu would be used. but that does not seem to be the case.

I have not been able to understand why that is for the longest time. My runpod serverless pods only show cpu usage when tested.

Any advice?

Jason•10/24/24, 12:19 PM

Try to test with cpu pods only and if it's the case ( it's only using cpu )

Jason•10/24/24, 12:20 PM

If it takes longer to use cpu pods (10 cores ish) probably it isn't using cpu only, and if it's the same time as gpu with cpu then yeah there might be a problem

Jason•10/24/24, 12:20 PM

The stats isn't updated that often so it may be not accurate

zfmoodydubOP•10/24/24, 12:21 PM

good advice thank you. Ive tried to deploy the same image to cpu only pods using a heavy duty cpu but the same image fails to initialize in a cpu pod. probably because im using this base image:

runpod/base:0.4.0-cuda11.8.0

Jason•10/24/24, 12:22 PM

Is it only the Stat from the website or you actually tested to print inside that logic (if)

Jason•10/24/24, 12:22 PM

Maybe you can try that too to make sure if it detects Nvidia gpu

Jason•10/24/24, 12:23 PM

Make sure to have Cuda inside your image, or use nvidia's base image from (ngc) search in Google Nvidia ngc

zfmoodydubOP•10/24/24, 12:23 PM

i do have a device type print log after most of those declarations, and it always says using cpu

zfmoodydubOP•10/24/24, 12:24 PM

sorry m8 this one goes over my head a bit:

"Make sure to have Cuda inside your image, or use nvidia's base image from (ngc) search in Google Nvidia ngc"

I thought i would have cuda inside my image via the base image name...

need to study up on what you mean by that

zfmoodydubOP•10/24/24, 12:24 PM

thanks for the direction

Jason•10/24/24, 12:25 PM

What's ur base image?

zfmoodydubOP•10/24/24, 12:25 PM

runpod/base:0.4.0-cuda11.8.0

Jason•10/24/24, 12:25 PM

I think it has Cuda already yep

Zzfmoodydub I am building serverless applications that are supposed to be using gpu, while t...

Jason•10/24/24, 12:26 PM

What is th object here? From th.cuda.is_...

zfmoodydubOP•10/24/24, 12:33 PM

sorry @nerdylive i am not sure the answer to your question. that declaration is littered throughout some open source code multiple times.

Jason•10/24/24, 12:34 PM

I'm not sure what's causing the problem here

Jason•10/24/24, 12:34 PM

Can you check the logs of the worker that ran

Jason•10/24/24, 12:34 PM

Any errors maybe? Showing that gpu or Cuda isn't available

zfmoodydubOP•10/24/24, 12:38 PM

when i run a process in a particular pod that im seeing the issue, it does say that "cuda is not available using cpu" but another serverless pod (the one im talking to you about in a different thread) using the same base image does not have this problem. so i believe this is an internal code thing in my repository.

after noticing that, i do not think this is a runpod problem. I can dig deeper there. thanks for your responses!

Jason•10/24/24, 12:41 PM

Hmm weird

Jason•10/24/24, 12:41 PM

Maybe use another pod, try

Jason•10/24/24, 12:41 PM

Have you?

zfmoodydubOP•10/24/24, 1:06 PM

i have not, not the most seasoned engineer, and havent had much luck successfully deploying my apps with anything else but with this base image:

runpod/base:0.4.0-cuda11.8.0

so ive really only been using that base image for my apps for about a year

Jason•10/24/24, 1:17 PM

Ooh yeah I mean re deploy it in another pod maybe it works on another machine