Hi there, I am trying to run a Dreambooth training through a serverless endpoint
Using the A100 80 GB GPU, is this perhaps not a good GPU for this type of training?
Using this template as a base, but I did modify it a bit, also modified the accelerate command with some other params but I wouldn't expect it to run out of memory.. https://github.com/runpod-workers/worker-lora_trainer