Hello everyone. I am Dr. Furkan Gözükara. PhD Computer Engineer. SECourses is a dedicated YouTube channel for the following topics : Tech, AI, News, Science, Robotics, Singularity, ComfyUI, SwarmUI, ML, Artificial Intelligence, Humanoid Robots, Wan 2.2, FLUX, Krea, Qwen Image, VLMs, Stable Diffusion
Hey, in full stable diff video tutorial I've found that I have learning rate set to 8e-06 with your presets, whereas on video you have 1e-05. However I've used fast preset as I have 24 of VRAM. Is it the case?
btw. what is resolution of your posture pictures you use for training? I've noticed that man dataset has 1024x1024, so are your photos also on 1024x1024? Also should be these as jpg or can be as png? Note that png is 10 times bigger.
another question, in your concept dataset you set prompt source as single file pointing to ohwx man.txt, but you have captions for each of the file, is the onetrainer looks at this file and also checks other txt files which coresponds to jpg files?