R
RunPod4mo ago
letajmal

What is the recommended System Req for Building Worker Base Image

I was trying to build a custom runpod/worker-vllm:base-0.3.1-cuda${WORKER_CUDA_VERSION} image, but my 16vCPU, 64GB RAM server crashed. What is the recommended system spec for this purpose
13 Replies
ashleyk
ashleyk4mo ago
@Alpay Ariyak can probably advise on this and maybe add it to the docs.
Alpay Ariyak
Alpay Ariyak4mo ago
Are you trying to build the worker image or the vllm base image
letajmal
letajmal4mo ago
vllm base image can i build using runpod cloud, i can understand these are containers, but is there any docker in docker kind of setup
ashleyk
ashleyk4mo ago
You can only use docker on CPU pods not on GPU pods
letajmal
letajmal4mo ago
since CPU pods are not that powerful, they are useless in this context
ashleyk
ashleyk4mo ago
You can use CPU pods to build images, because people request the ability to do so but yes they obviously don't have a GPU, you can't use docker on GPU pods for security reasons.
letajmal
letajmal3mo ago
Let me try, thanks for the info I have added a template for docker.io/docker but i am not able to start a web terminal session
ashleyk
ashleyk3mo ago
Check the logs for your workers by clicking on one of them, you can only access the web terminal while a worker is actually running
letajmal
letajmal3mo ago
I was able to build the image(CUDA 11.8) using a GCE VM 24vCPU, 128 GB. I am not sure if this is required, i was using the max cap in gcp free tier Also docker.io/docker image was not working because bash was not installed, so the ssh connection was failing If someone is trying to use runpod CPU/GPU for this purpose use some ubuntu dind image or github actions runner image by the community for CUDA 12.1, this configuration is not enough
ashleyk
ashleyk3mo ago
Which configuration?
Also docker.io/docker image was not working because bash was not installed, so the ssh connection was failing
Sounds like you're using a bad base image, bash is always usually installed.
letajmal
letajmal3mo ago
24vCPU, 128 GB is not enough for CUDA 12.1
ashleyk
ashleyk3mo ago
The vCPU and system memory have nothing to do with the CUDA version, the system requirements depend on your application, not the CUDA version.
Alpay Ariyak
Alpay Ariyak3mo ago
Add --build-arg max_jobs=2 --build-arg nvcc_threads=8
Want results from more Discord servers?
Add your server
More Posts