RunpodR
Runpod2y ago
7 replies
smoke

CUDA out of memory (80GB GPU)

Hi there, I am trying to run a Dreambooth training through a serverless endpoint

Using the A100 80 GB GPU, is this perhaps not a good GPU for this type of training?

Using this template as a base, but I did modify it a bit, also modified the accelerate command with some other params but I wouldn't expect it to run out of memory..
https://github.com/runpod-workers/worker-lora_trainer
image.png
image.png
GitHub
Contribute to runpod-workers/worker-lora_trainer development by creating an account on GitHub.
Was this page helpful?