Hello everyone. I am Dr. Furkan Gözükara. PhD Computer Engineer. SECourses is a dedicated YouTube channel for the following topics : Tech, AI, News, Science, Robotics, Singularity, ComfyUI, SwarmUI, ML, Artificial Intelligence, Humanoid Robots, Wan 2.2, FLUX, Krea, Qwen Image, VLMs, Stable Diffusion
I've used your Flux dreambooth training presets and the results are amazing. Would training at 1280 x 1280 increase the quality even further or is that useless to try?
I usually do LoRAs over full dreambooth trainings. did one LoRA at 1536x1536. on my 3090 i did 150 epochs on 20 images and it took 22 hours same dataset and settings at 1024x1024 had only taken 7 hours. (i think the configs and kohya itself have both been optimized a lot since then) i do feel like the 1536 LoRA is better, more detailed. but after that i stuck with 1024 - didn't feel like the improvement (it's usually subjective by then anyway with character LoRAs from only epoch 100-125 turning out awesome as well) were worth triple the training time.
I trained 59 images at 1024 x 1024 190 epochs (of a complex artstyle) and it works great even at high resolutions like 2000x2500 or something without upscaling. So I'm really curious to see if training at 1280x1280 would offer any improvements at high resolutions. I'll let you know whenever it finishes.
Some things look better, some things look worse. I think it's not really worth it... It seems to add more detail to small text or backgrounds, but the characters look worse
I rented massed compute first time after few month off since i have rtx 4090... I though generating wan2.1 will be much faster....lol.. definitely not on L40S lol slow.. Maybe H100, but to expensive..
man I'm trying to do dreamboothsdxl training to the likeness of an individual and I've tried so many times and the sample output images continue sucking. I subscribed to the Patreon, can anyone point me towards the most useful pages re: optimal parameters and so on?
If your training data set is 59 images and each is being repeated 10 times, how many regularization images is ideal? Also, is repeating 10 times the best number of repeats?
If you want to train FLUX with maximum possible quality, this is the tutorial looking for. In this comprehensive tutorial, you will learn how to install Kohya GUI and use it to fully Fine-Tune / DreamBooth FLUX model. After that how to use SwarmUI to compare generated checkpoints / models and find the very best one to generate most amazing image...
Are there any special instructions or 'Additional parameters' to enter into the Kohya_ss GUI when trying to "Resume from saved training state (path to "last-state" state folder)?"
I have entered the path/workspace/stable-diffusion-webui-forge/models/Stable-diffusion/fruit_realismByStableYogi_v5XLFP16/model/ohwx-woman_Fruit_Full-finetune_adafactor_realismIllustriousBy_v35FP16_292-Img-200-Instances-step00032000-state
I have verified the contents within that folder include all of the expected, model_1, model, optimizer, random_states, scheduler.
The training starts, however, it starts at step 01, versus step 32001 as I intend to continue traning.
Can anyone offer insight or help here? Thank you in advance!
Thanks! i've just upgraded my setups from 4090 to 5090 and noticing that I have problems running pretty much everything I tried before thanks again! Now i think im getting it
Hey, I wonder, has anyone tried / have article or any sort of content about full checkpoint finetuning on Wan ? I really want to get my hands into it but it'd be great to have some references to start working on it
What are the optimal sdxl fine tune and lora configs? Having a hard time finding them. Using massedcompute. Onetrainer or dreambooth? Was thinking of using big love xl 2.5 as base checkpoint. Like the realism and quality of pics there.