nielsrolf
nielsrolf
RRunPod
Created by nielsrolf on 4/6/2025 in #⛅|pods-clusters
Uncorrectable ECC error encountered
No description
4 replies
RRunPod
Created by nielsrolf on 3/28/2025 in #⛅|pods-clusters
Model upload to huggingface is so slow it costs more than training
No description
7 replies
RRunPod
Created by nielsrolf on 11/19/2024 in #⛅|pods-clusters
Starting a pod with runpod/pytorch:2.4.0-py3.11-cuda12.4.1-devel-ubuntu22.04 has cuda version 12.6
I am confused what determines the cuda version of a pod I start. I would expect that when I start a docker image with a cuda version in the name that it has this cuda version bundled into the image and when I start the pod that this is the cuda version I see, but this is not the case. How can I start a pod with a predictable cuda version?
6 replies
RRunPod
Created by nielsrolf on 11/12/2024 in #⚡|serverless
Incredibly long startup time when running 70b models via vllm
No description
11 replies