Software Engineering Courses (SECourses)•15mo ago

just putting the data set together, correctly, for a base model, is quite a task that requires exper

just putting the data set together, correctly, for a base model, is quite a task that requires expertise - never mind training the thing

Ccrystalwizard just putting the data set together, correctly, for a base model, is quite a task...

B0nd•10/7/24, 8:38 AM

Yeah dataset preparing and gathering requires a ton of time and resources

crystalwizardOP•10/7/24, 8:39 AM

yes, and you're not going to just go scrape google for images - that's why most of them use the LAION library

crystalwizardOP•10/7/24, 8:39 AM

billions

crystalwizardOP•10/7/24, 8:40 AM

of images

crystalwizardOP•10/7/24, 8:40 AM

at least 2 billion if you're just making a tiny model

B0nd•10/7/24, 8:41 AM

I’m planning to build a SaaS product based on some ComfyUI workflows I’ve created, using Flux as the main image generation model. I’ve spoken with the team at Fal.ai, and they confirmed I can use it commercially through their platform, as long as it runs through their API due to their arrangement with Blackforestlabs. They also mentioned they can host ComfyUI and that it’s possible to use ComfyUI as a backend for a SaaS product, as long as all models used in the workflow (like upscalers, facial detection, etc.) are licensed for commercial use. I’m still deciding whether to stick with ComfyUI or build a custom pipeline based on my workflows.

Punkxy•10/7/24, 8:42 AM

Hey Dr. Furka, my friend is training a flux character lora (not realistic) with default settings of ai-toolkit and I try training with kohya and your config rank 3, and he seems to get really good results, but I'm having a problem getting the proper body shape and some details properly.
It seems like no matter what parameters I try, it feels undertrained.
tried 1 repeat with 200 epochs (20 images) lr0.00005
tried 20 repeat with 10 epochs (20 images) lr0.0001
tried 150 repeat with 1 epoch, (save every 1000 steps) tried the output of 2000, 3000 ,4000, and unless I make the prompts very long and detailed, I'm getting something that resembles the character and clothes, but the body shape and face mostly gets random every seed and it won't really capture the character.
Even if I put a lot of details in the prompt, it kinda works, but not really.

And yet, my friend's first attempt with default settings on ai-toolkit, which has much less parameters to change, gets awesome results and accuracy, and it's still very flexible while keeping the essence of the character pretty well.

Any idea what I'm doing wrong?

Punkxy•10/7/24, 8:43 AM

forgot to mention, we trained the same character
his captions are mostly automated with roughly manual fixes
my captions were automated at first, then highly reconstructed manually in a consistent captioning way

Punkxy•10/7/24, 8:44 AM

same dataset too

Punkxy•10/7/24, 8:45 AM

Right now I'm trying to replicate his decent+ result of ai-toolkit, but with kohya_ss, just to have a solid starting point, but I can't manage to achieve that.

B0nd•10/7/24, 9:36 AM

Anyone here trained their own upscaler model before?

Sebastianblood•10/7/24, 9:47 AM

Any recommendations on how to setup A1111 on AWS?

SSebastianblood Any recommendations on how to setup A1111 on AWS?

Sebastianblood•10/7/24, 9:47 AM

Or Forge

DDarth Teethius quick question, how is everyone captioning for flux training? single trigger wor...

Furkan Gözükara SECourses•10/7/24, 9:53 AM

you dont need caption if your dataset is consisten enough

DDarth Teethius @Dr. Furkan Gözükara im trying the new flux training scripts. Using the json for...

Furkan Gözükara SECourses•10/7/24, 9:53 AM

enable shared vram

Furkan Gözükara SECourses•10/7/24, 9:53 AM

initial loading uses VRAM

Furkan Gözükara SECourses•10/7/24, 9:54 AM

i trained on my rtx 3060 yesterday perfectly training

BB0nd hey @Dr. Furkan Gözükara Are there any additional benefits to using the DreamBoo...

Furkan Gözükara SECourses•10/7/24, 9:55 AM

well some people want to share and keep lora

Furkan Gözükara SECourses•10/7/24, 9:55 AM

only benefit it is

Furkan Gözükara SECourses•10/7/24, 9:55 AM

otherwise original checkpoint best

PPunkxy Hey Dr. Furka, my friend is training a flux character lora (not realistic) with ...

Furkan Gözükara SECourses•10/7/24, 9:56 AM

dont use captions

Furkan Gözükara SECourses•10/7/24, 9:56 AM

when you are training a single concept

Furkan Gözükara SECourses•10/7/24, 9:56 AM

if your concept is consisten, captions reduces likeliness

SSebastianblood Or Forge

Furkan Gözükara SECourses•10/7/24, 9:56 AM

my installers should work there

Furkan Gözükara SECourses•10/7/24, 9:56 AM

of course i assume python cuda cudnn are install

Furkan Gözükara SECourses•10/7/24, 9:56 AM

and nvidia drivers

FFurkan Gözükara SECourses dont use captions

Punkxy•10/7/24, 9:57 AM

so what should I use?
instance prompt and class only?

Furkan Gözükara SECourses•10/7/24, 9:57 AM

see forge installer here : https://www.patreon.com/posts/110323512

Patreon

Forge Web UI Latest Version RunPod Auto Installer and FLUX Auto Mod...

Get more from SECourses: FLUX, Tutorials, Guides, Resources, Training, Scripts on Patreon

PPunkxy so what should I use? instance prompt and class only?

Furkan Gözükara SECourses•10/7/24, 9:57 AM

yes

Furkan Gözükara SECourses•10/7/24, 9:57 AM

ohwx + class

Punkxy•10/7/24, 9:57 AM

what about the outfit?

Punkxy•10/7/24, 9:59 AM

and even though the character is not accurate enough, I do find that my consistent captioning structure for the angle the character is standing and face expressions that are not the neutral ones works pretty well when I prompt for it
so should I caption those?

Punkxy•10/7/24, 9:59 AM

or does it reduce likeness aswell

PPunkxy what about the outfit?

Furkan Gözükara SECourses•10/7/24, 10:00 AM

FLUX uses image itself as embedding

Furkan Gözükara SECourses•10/7/24, 10:00 AM

so all is like auto captioned

Furkan Gözükara SECourses•10/7/24, 10:00 AM

i dont think you need to caption

Punkxy•10/7/24, 10:10 AM

and if the character is in a complex background, compared to some images with simple grey background, still not caption?

Punkxy•10/7/24, 10:13 AM

It seems like someone achieved great results and accuracy with captioning that looks like joy caption, and when he prompted for the example images, look at how short and simple the prompts are, and yet, with a very short trigger word he's generating Yasuke very accurately
https://civitai.com/models/459468/naoe-and-yasuke-assassins-creed-shadows-fluxponyxl?modelVersionId=919340

Punkxy•10/7/24, 10:14 AM

How does one achieve this? this is what I'm trying to figure out

Punkxy•10/7/24, 10:14 AM

easy to use and high accuracy lora

PPunkxy and if the character is in a complex background, compared to some images with si...

Furkan Gözükara SECourses•10/7/24, 10:35 AM

well if your background repeating

Furkan Gözükara SECourses•10/7/24, 10:35 AM

if you dont prompt background in inference

Furkan Gözükara SECourses•10/7/24, 10:35 AM

it will generate it

Furkan Gözükara SECourses•10/7/24, 10:35 AM

otherwise no issue

PPunkxy It seems like someone achieved great results and accuracy with captioning that l...

Furkan Gözükara SECourses•10/7/24, 10:35 AM

simple

Furkan Gözükara SECourses•10/7/24, 10:35 AM

have both characters in same image

Furkan Gözükara SECourses•10/7/24, 10:35 AM

while training

Furkan Gözükara SECourses•10/7/24, 10:35 AM

so it learns perfect both

Punkxy•10/7/24, 10:52 AM

he used
ss_learning_rate": 0.000002,
"ss_text_encoder_lr": 0.00005,
"ss_unet_lr": 0.00005

and
"ss_network_dim": "Dynamic",
"ss_network_alpha": "Dynamic",

what do you think about that?

Punkxy•10/7/24, 10:53 AM

his lora file size is decent too 459mb, not too big

just putting the data set together, correctly, for a base model, is quite a task that requires exper

Similar Threads