Hello everyone. I am Dr. Furkan Gözükara. PhD Computer Engineer. SECourses is a dedicated YouTube channel for the following topics : Tech, AI, News, Science, Robotics, Singularity, ComfyUI, SwarmUI, ML, Artificial Intelligence, Humanoid Robots, Wan 2.2, FLUX, Krea, Qwen Image, VLMs, Stable Diffusion
i just spend a month walking through the flux dev and flux pro latent space, one token at a time. what was done to flux to hide the warping and shortening issues that everyone saw with the sd3-2b-medium release (because flux is the exact same neural network and setup) is really ugly, and undoable
I trained flux with some big datasets off 3500 images for 40 epochs and I hade great results but is for a style and some new concepts, is unable to train many specific individuals consistently
you'll get 'something' - but you won't get something that actually updates flux's weights the way a training is supposed to. all you're going to get is something that overwrites what the AI knows. the results you'd get would be different if it was actually training the model
that's what a LoRA is for - to allow you to add information and update the weights for specific things without retraining the entire model. but what you're getting is about the same as if someone took a piece of paper, drew on it, and pasted it over a photo. you're not changing the photo, you're just putting somethign over it.
and because flux IS NOT trainable, regardless of what you keep seeing people say, when you try to train it for things that do require those weights to be updated, it can't happen
also, i'll be very surprised if black forest releases any updates to dev or schnell anytime soon. the money is in pro, and the people that are paying 5 cents a generation for it's API
let's see, for know it works for my purpose, I can train a style and some concepts and on top a character at a time, that works perfect for me, is tedious because I need a lora for each subject but is a workaround
they couldn't fix the issues it has, so they did stuff to mask them, and that makes it almost unusuable for anything other than exactly what they released, with a very narrow range of what it can successfully do
I haven't spoken much about my ongoing project to de-distill schnell to make a permissive licensed version of flux, but I have been updating it periodically as it trains. I just noticed it is the #2 trending text-to-image model on Hugging Face. Working on aesthetic tuning now.
it's sort of like this - if you mix up flour, water, eggs, chocolate chips, sugar, baking soda, salt - you have a batter that can be turned into a lot of things. YOu could make a chocolate chip cake or cookies at this point. if you then bake cookies, you have cookies. and no one can come along and turn those cookies back into batter.
you can then shape the mush into some other shape, and say you turned it back into batter and now you've baked a cake or something - but you didn't actually do that. you just made mush and reshaped cookies into cake shaped cookies that now don't taste all that good
eh, he'll keep working on it till he gets something that he'll then prance around and boast about. you'd think HE would know better - but he's backed himself into a corner