Hello everyone. I am Dr. Furkan Gözükara. PhD Computer Engineer. SECourses is a dedicated YouTube channel for the following topics : Tech, AI, News, Science, Robotics, Singularity, ComfyUI, SwarmUI, ML, Artificial Intelligence, Humanoid Robots, Wan 2.2, FLUX, Krea, Qwen Image, VLMs, Stable Diffusion
so even with 5 images 1024/1024 fails, but 768/768 works with any number of images on 8GB torch 2.5 . I even changed OS to windows 11 to try to figure out.. maybe its a memory leak in torch 2.5 and older nvidia.
FYI precompiled pytorch - does not require cuda installed on windows. it comes built in.. I have completely removed VS2022 and cuda from my system during troubleshooting
i keep trying to install kohya with flux per instructins but after i copy paste the first step, it ends with this error "Copying accelerate config file to: /root/.cache/huggingface/accelerate/default_config.yaml Traceback (most recent call last): File "/workspace/kohya_ss/kohya_gui.py", line 6, in <module> import gradio as gr ModuleNotFoundError: No module named 'gradio' root@c9368cea187b:/workspace# "
I'm trying to train a lora/dreambooth to make simple icon type images of a particular style, and having watched the tutorials and looked at the sample outputs, I'm not sure if this project is better suited to dreambooth or lora training. The attached image is an example of one of the training images.
I keep seeing people saying that it's best to do no captioning when training a style, but it seems odd to me -- would I not want to give it at least a minimal caption, for example "[triggerword], cow, bat wings"
The only way I've gotten it to work close to the way I want it to is using the flux gym on default settings, but I would prefer to use Kohya -- based on the preferred results from flux gym, should I increase the number of repeats (from 1) and decrease the number of epochs (from 200)