Hello everyone. I am Dr. Furkan Gözükara. PhD Computer Engineer. SECourses is a dedicated YouTube channel for the following topics : Tech, AI, News, Science, Robotics, Singularity, ComfyUI, SwarmUI, ML, Artificial Intelligence, Humanoid Robots, Wan 2.2, FLUX, Krea, Qwen Image, VLMs, Stable Diffusion
i've got close to 3500 hours in on studying stable diffusion in the last 2.5 years. and close to 300 hours in on studying flux since it released. if not more
i literally spend an entire month sitting here, feeding it one single prompt at a time to see what it would return by default - walking through it's latent space
it's massively overfit, and it's been heavily doctored to deal with the 'woman laying in the grass' warping and shortening issue it shares with sd3-2b-medium.
sure, because it uses the t5xxl encoder nd the clip_l encoder. so you're going to get coherant. but give it a word like Umber and you should get mostly the color. the dictionary defintion for umber is "a natural pigment resembling but darker than ocher, normally dark yellowish-brown in color ( raw umber ) or dark brown when roasted ( burnt umber ). 2. a brownish-gray moth with coloring that resembles tree bark."
based on the blowup that he had a few weeks back, he'll refuse to admit he cna't do what he wants. he'll continue on till he gets 'something' and then boast about it
and it'll return some sorts of results, and people will believe he undistilled flux because they don't know any better. and blackforest will sit there and laugh at him
we get requests along those lines for Photoshop all the time "hi, i have this photo of this car, can you make it so i can read the license' - and the plate is just a black and white blur. 'um, no. this isn't camera raw footage, it's a jpg"
what could be a success is to make a model that is more flexible and more trainable, will not be the black forest pro model, will be other model that is more trainable
to do that, robin will need to scrap flux and start over. and that was at least a million, possibly more, to train. that's a huge hit now that he's nto at stability.ai
what he's most likely to do is continue to develop Pro - it's api only, it's making him money, he can sell it to top movie studios and commercial businesses
you can train with a very diverse people all genders all races all ages 3000 pictures at least for many epochs and may be end up more flexible and training on top produces lees bleeding, that will be a success for me, idk
not even robin can train it. he'd have to train the base model, and that's expensive. VERY expensive. hundreds of dollars at the minimum, but more like thousands of dollars if not more