Hello everyone. I am Dr. Furkan Gözükara. PhD Computer Engineer. SECourses is a dedicated YouTube channel for the following topics : Tech, AI, News, Science, Robotics, Singularity, ComfyUI, SwarmUI, ML, Artificial Intelligence, Humanoid Robots, Wan 2.2, FLUX, Krea, Qwen Image, VLMs, Stable Diffusion
Thanks for the response! They are high quality screenshots from a show with a unique art style, consistent in style only. All of them are uniquely captioned in the same format to describe the image, with a triggerword for the style at the start of the caption. I'll try as you suggested. One more question: Can I train 10 epochs, and with the resulting model, continue training for another 5 epochs in a separate instance (using the resulting model as a base) to get the same or similar result as if I were training 15 epochs from the start? A checkpoint per epoch would require me to regularly offload the models onto another server due to disk space, risking messing this up and epochs not saving.
Also, although maybe not particularly relevant. They are actually 2 sets of the same 900 images in different aspect ratios, according to my own testing and other testing I've seen, training on different aspect ratios improves results. Particularly with style finetunes. Never tried it with this many images though.
torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 20.00 MiB. GPU 0 has a total capacity of 79.20 GiB of which 16.62 MiB is free. Process 2510593 has 79.18 GiB memory in use.,
training was successful on batch size=2 on h100 sxm This thing is demanding indeed, consumed like 77gb vram
I was wondering if you have a python code that I can refer/modify for finetuning instead of using the kohya gui? I want to load the parameters and train without having to open the interface and load the parameters all the time.
"I’d like to support this request. It would be beneficial if the developers could add support for the Chroma model.
Chroma is an 8.9B parameter model based on FLUX.1-schnell. More information can be found here: huggingface.co/lodestones/Chroma
Additionally, ai-toolkit and the diffusion-pipe project at github.com/tdrussell/diffusion-pipe already supports Chroma. (LoRA/fine-tuning).
Thank you for the fantastic sd-scripts project; your work is greatly appreciated!"
I came across this comment by blackmagix24. I will give diffusion pipe a shot. if you happen to test it in the future, I would love to see a video from you for chroma fine tuning along with a comparison with flux dev
yeah, when I tried chroma the images weren't as good as flux dev, but on the bright side chroma has apache license and I was wondering if finetuned well, will the results be in par with flux dev.