CUDA out of memory (80GB GPU)
Hi there, I am trying to run a Dreambooth training through a serverless endpoint
Using the A100 80 GB GPU, is this perhaps not a good GPU for this type of training?
Using this template as a base, but I did modify it a bit, also modified the accelerate command with some other params but I wouldn't expect it to run out of memory..
https://github.com/runpod-workers/worker-lora_trainer
Using the A100 80 GB GPU, is this perhaps not a good GPU for this type of training?
Using this template as a base, but I did modify it a bit, also modified the accelerate command with some other params but I wouldn't expect it to run out of memory..
https://github.com/runpod-workers/worker-lora_trainer


GitHub
Contribute to runpod-workers/worker-lora_trainer development by creating an account on GitHub.
