Hello everyone. I am Dr. Furkan Gözükara. PhD Computer Engineer. SECourses is a dedicated YouTube channel for the following topics : Tech, AI, News, Science, Robotics, Singularity, ComfyUI, SwarmUI, ML, Artificial Intelligence, Humanoid Robots, Wan 2.2, FLUX, Krea, Qwen Image, VLMs, Stable Diffusion
lol i publish thousands of dollars worth article, lots of research info and still some people complains like why i use lora rank 128 instead of 32 you can use if you wish
According to chatgpt it tells me that my image is a “Countryball”. but using some caption models trained in replicate or huggingface it tells me it's a “cartoon illustration”.
I have a doubt, and excuse my ignorance.... but when using the prompt is it necessary to write “cbznft cartoon illustration”? i.e. use both? or am I wrong?
stupid question: should or can training images also contain things to learn which are rare? like 98% of all the training images of the same person have blue eyes -- and 2% have green eyes (like they wearing colored contact lenses)?
@Dr. Furkan Gözükara I am going to try without *.txt captions, 800 images 1024x1024, what epoch should I use? I think there is a formula, and how often the checkpoint?
I agree, it adds a bit of grain to the images. It is also very acceptable for multiple loras. I have used like 5 with it together, and it worked marvellously.