Hello everyone. I am Dr. Furkan Gözükara. PhD Computer Engineer. SECourses is a dedicated YouTube channel for the following topics : Tech, AI, News, Science, Robotics, Singularity, ComfyUI, SwarmUI, ML, Artificial Intelligence, Humanoid Robots, Wan 2.2, FLUX, Krea, Qwen Image, VLMs, Stable Diffusion
you need lora only when you want to publish on civitai so that people can use on different base models - otherwise lora is no, it is inferior from all aspects. also you would want lora if you are service so you want to host small files
Hello everyone! I trained my character, Conan, through finetuning, and now I want to add a LoRA with a film style. However, when working with LoRA from CIVITAI in the style of a specific film genre (e.g., Dark Fantasy movies from the 80s) or a type of lens (e.g., Anamorphic Lens), I encountered an issue: at high LoRA values, it radically changes the character's clothing and appearance or removes objects from the environment, but adheres to the style better. At lower values (around 0.5 or less), it either doesn't work or leaves some noise.
I understand that the authors of these LoRA models often use 30-50 images for style training, which might be the main limitation in the flexibility of LoRA models.
I could create a dataset of 100-500-1000 images for the Dark Fantasy movies 80s style. But tell me, should I train it on my already finetuned model with my character (will it affect his appearance too much?), or should I create a new LoRA with a larger dataset of images (100-300-500) for higher-quality generations? What do you think will work better?
If I want a lora to impact only my characters body, should I crop to only include body in training images my current solution is inpaintinf my face with segment as shown, but face does change when using the lora
Alright, then what are the settings for training through DreamBooth? As I understand it, your configs are more optimized and suitable for memorizing characters? Since for style, a more flexible mode of memorization is required, as well as detailed captions for greater flexibility?
And is it possible, for example, to use widescreen images for training so that it works better at this resolution? Moreover, this will simplify the process of preparing the dataset.
@NicB@SpecialHelper @Furkan Gözükara SECourses Hey guys I am currently following Dr. Furkans tutorial on how to do a Full File Tuning on Massed Compute and in the picture you can see what I want to deploy for the process.
1. But I dont know it the training will be faster if I use 2x A6000 with 96gb vram and more space. 2. I dont know hom much money I should put on the account for these trainings:
- 1x for me, 200 png dataset, various poses, expressions and clothing - 1x for my Brother, 100 png dataset, various poses, expressions and clothing (I want max quality possible)
And what parameters would you guys suggest me to use?
And I also don't know how to put money on the massed compute account and I have also connected my credit card
And also another thing that I would like to know is:
Can I close my Browser or even shut down my PC while the training is running on massed Compute? Because I saw that the the best training for about 250 images dataset will take about 30-31 hours per training
Just don't shutdown the VM instance in massed compute itself. It is like a separate computer for yours so everything will run as long as you don't shut it down