solanotodeschini
RRunPod
•Created by solanotodeschini on 4/2/2025 in #⛅|pods-clusters
Axolotl Fine Tune Error (flash_attn_2_cuda.cpython-311-x86_64-linux-gnu.so: undefined symbol)
Hi! I was using axolotl image for fine tuning successfully but now I'm getting this error:
flash_attn_2_cuda.cpython-311-x86_64-linux-gnu.so: undefined symbol
Everything was working normally until yesterday. I'm following the steps in the fine tuning tutorial: https://docs.runpod.io/tutorials/pods/fine-tune-llm-axolotl#using-a-hugging-face-dataset1 replies
RRunPod
•Created by solanotodeschini on 12/19/2024 in #⚡|serverless
Worker not executing job
Hi! I'm trying to deploy a simple serverless worker for PDF processing that requires GPUs. I've followed the docs and was able to build it. But, when I send a request, it's forever stuck and workers exit.
I'm having a hard time debugging it since it takes more than half an hour to build and upload the image.
Dockerfile:
3 replies