Hello everyone. I am Dr. Furkan Gözükara. PhD Computer Engineer. SECourses is a dedicated YouTube channel for the following topics : Tech, AI, News, Science, Robotics, Singularity, ComfyUI, SwarmUI, ML, Artificial Intelligence, Humanoid Robots, Wan 2.2, FLUX, Krea, Qwen Image, VLMs, Stable Diffusion
my process i learned on reddit is to make two data sets of same images, 512x512 and 768x768, run each set seperate lora traing, then take the best of each and merge lora 100% and they are really nice. turn down weight to 0.4 and its amazing, you ever try that method?
I think that because I can only apply float8 to unet and in cpu offload, unet is already the only one that remains in the cuda, so nothing changes if the memory consumption is lower than the total available of the video card.