Hello everyone. I am Dr. Furkan Gözükara. PhD Computer Engineer. SECourses is a dedicated YouTube channel for the following topics : Tech, AI, News, Science, Robotics, Singularity, ComfyUI, SwarmUI, ML, Artificial Intelligence, Humanoid Robots, Wan 2.2, FLUX, Krea, Qwen Image, VLMs, Stable Diffusion
@Dr. Furkan Gözükara Here a gen from a custom fine tune on large images I’ve tiled and cut into 1024x1024 pixel chunk separate pngs to capture every tiny possible detail from the up to 6K large images. It was an experiment and it went well with your config. Original image count was 15 for those large images and then after filing there were over 400 images. So this seems to be the trick to be able to train on any resolution images without losing any details.
made a new flux fine tuning model of my friend with 58 images, also gave great and trustworthy results. im wondering how low it can go. my first attempt with 15 images was too glass-skin like. but yea at least now i know that 58 images works well!
I've written a python script with chatGPT that will tile them with overlap and is also including the original full size image. Plus I'm using flux dev de-distilled.
@julius so yeah the fine details the model can't train on when only the full image is trained on it's own since the image gets automatically downscaled to 1024x1024 so the training will never see the full details in your dataset. I've taken a step further and have 3 different resolutions for the original image in 2x increments so that it will have 1024x1024 which is the full image downscaled to give the training the full context of the subject then a 2048x2048 version which will serve for 2x Ultimate SD Upscale training on 1024x tiles and then a 4096x4096 size tiled in 1024x1024 tiles to serve as data for training for 4x Ultimate SD Upscale and lastly the original resolution whatever it is to be tiled into 1024x1024 tiles with overlap to not miss any detail from the original dataset.
Has anyone used the nee deep face similarity app? It works great but I'm not sure what sort of numbers are considered very similar in terms of comparison. I know lower is better but is .3 good or still very off for instance?