Hello everyone. I am Dr. Furkan Gözükara. PhD Computer Engineer. SECourses is a dedicated YouTube channel for the following topics : Tech, AI, News, Science, Robotics, Singularity, ComfyUI, SwarmUI, ML, Artificial Intelligence, Humanoid Robots, Wan 2.2, FLUX, Krea, Qwen Image, VLMs, Stable Diffusion
Yea, im just trying to figure out if that's something I can trade off on to cut training time. On 4090 tier_4 (lowest of the 4 configs) on 200 epochs im getting a 5.5hr ETA on training for 30 images for FLUX. Intermediary checkpoints are cooking well, but the time it wants to take is a bit of a bummer.
thank you for telling me the obvious better solution, its good to be reminded of that. Lots of this is time intensive. But i am seeking to optimize where i can which is the basis for the original query.
Would love to hear about it if you find a custom GPT or LLM Instructions that reliably understand how Flux likes to be prompted. Often, It feels like they just shotgun a cloud of text at the Clips.
I mean the answer lies on the dataset of both the T5 and Clip models . If we had access to those and train an LLM to give prompts based on it it would be dope
Anyone around her ehave some experience with training loras? I want to learn how to make them, but I'm not sure what is the most effective method of doing it and it seems like tehre are quite a few ways to do it. Anyone have a recommended way? I've been using comfyui pretty much exclusively for image gen and if quality is equally as good, I'd prefer comfy
@Dr. Furkan Gözükara Im also training a style lora to get very realistic images. I have seen some LoRAs like these on civit ai https://civitai.com/models/652699/amateur-photography-flux-dev I'm trying to train to get this style but been struggling to get the skin texture right. I tried with 150 images, 8000 steps, batch size-2, rank- 64. I have also added 10 face closeups to get some skin texture. I mostly sourced images from a friend who is a photographer and then from stock image websites. What am I doing wrong? Is my dataset too less or im still picking clean looking images and should do more images which are actually taken from phone?
Stability AI published their most power newest model Stable Diffusion 3.5 Large. This model unlike FLUX is full model not distilled and has huge potential. I have done extensive research and publishing all of it in this video regarding how to use SD 3.5 Large with the best settings. Moreover, I am sharing how to use FLUX DEV with the best possib...
Ultimate Kohya GUI FLUX LoRA training tutorial. This tutorial is product of non-stop 9 days research and training. I have trained over 73 FLUX LoRA models and analyzed all to prepare this tutorial video. The research still going on and hopefully the results will be significantly improved and latest configs and findings will be shared. Please wat...
Unlock the power of FLUX LoRA training, even if you're short on GPUs or looking to boost speed and scale! This comprehensive guide takes you from novice to expert, showing you how to use Kohya GUI for creating top-notch FLUX LoRAs in the cloud. We'll cover everything: maximizing quality, optimizing speed, and finding the best deals. With our exc...