Software Engineering Courses (SECourses)•13mo ago

<@205854764540362752> do you have experience with training multiple models at a time with Kohya on m

@Furkan Gözükara SECourses do you have experience with training multiple models at a time with Kohya on multi-GPU systems? I read what you wrote about the bug leading to excess memory utilization when multi-GPU finetuning, so you don't get linear increases in training speed with increases in GPU numbers.

I've been lucky enough to get a lot of credits from the great guys at TensorDock so I could try with multi-H100 SXM systems; I can confirm the bug but I can also tell you that you can run 2 instances of Kohya at the same time, and in that case you do get double the training speed.

Of course this is only useful if you have more than one model to train in the first place…

macbaOP•12/20/24, 6:00 PM

Every GPU is taking about 4.3 seconds per iteration (using batch 4), don't know how that compares. However, it's really a shame to see more than half the VRAM lying there unused

AArtChicken I would like to train a SDXL model of the same person, with same training images...

Furkan Gözükara SECourses•12/20/24, 9:21 PM

we have kohya config too

Furkan Gözükara SECourses•12/20/24, 9:21 PM

so you can compare checkpoints of both in swarmui

QQM <@205854764540362752> I have tried now 6 different times Finetuning with two peo...

Furkan Gözükara SECourses•12/20/24, 9:21 PM

it is limitation of flux

Furkan Gözükara SECourses•12/20/24, 9:22 PM

sadly we couldnt solve it yet

Mmacba <@205854764540362752> do you have experience with training multiple models at a ...

Furkan Gözükara SECourses•12/20/24, 9:22 PM

yes i do this all the time

Furkan Gözükara SECourses•12/20/24, 9:22 PM

you can start as many as instances of kohya and make each one run on each gpu seperately

Mmacba <@205854764540362752> do you have experience with training multiple models at a ...

QM•12/20/24, 9:31 PM

I ran 6x Kohya - finetuning 6 different Flux models on 4090s

FFurkan Gözükara SECourses it is limitation of flux

QM•12/20/24, 9:32 PM

what would be the best way, if one has multiple concepts that need to be brought together -

QQM what would be the best way, if one has multiple concepts that need to be brought...

Furkan Gözükara SECourses•12/20/24, 10:16 PM

sadly no way atm

Furkan Gözükara SECourses•12/20/24, 10:16 PM

you need to have them in same image during training

Timson•12/20/24, 10:56 PM

@Furkan Gözükara SECourses can you use FLUX.1 Fill and FLUX.1 Depth with lora / dreambooth somehow? DO you have to train it separately? I'm having problem with yolov face inpaint when something in the square area of the face gets corrupted during the in-painting. Is there a way to do a non rectangular more advanced mask?

Timson•12/20/24, 10:59 PM

It's pretty much guaranteed to fail if you have another face or object partially obstructing face caught in the inpaint square

TTimson <@205854764540362752> can you use FLUX.1 Fill and FLUX.1 Depth with lora / dream...

�

🍭🎀 𝒜𝓋𝒶 𝐹𝓇𝒾𝑔𝑔 🎀🍭•12/21/24, 11:37 AM

yes, when using depth/canny you can't use segment in the prompt, you will have to inpaint face after, which can easily be done on all of them using image batch under tools. use init image, set creativity to 0, and only prompt the segment face part then run batch.

�

🍭🎀 𝒜𝓋𝒶 𝐹𝓇𝒾𝑔𝑔 🎀🍭•12/21/24, 11:41 AM

@Furkan Gözükara SECourses question about flux dev fine tune and reg. images. A thought occured to me that if you were training something which is completely unknown to flux dev, say i.e. a particular alien race which lives on planet X, and your fine tune is a particular character of that race. Is this a situation were regularisation images would make sense? Training imgs would be the character, and reg. images just images of many different kind of aliens in this race. tokens ohwx planetxalien

�🍭🎀 𝒜𝓋𝒶 𝐹𝓇𝒾𝑔𝑔 🎀🍭yes, when using depth/canny you can't use segment in the prompt, you will have t...

Timson•12/21/24, 12:26 PM

the question was: do I have to train separate model based on FLUX.1 Fill checkpoint for that? Or I have to train on the base flux dev then extract lora and apply it to flux fill?

uisato•12/21/24, 2:59 PM

https://www.youtube.com/watch?v=pyNTS4oYGFU

YouTubeuisato

Spectral Analysis - [Short Concept Video for @thevoidz_ ]

For more experiments, head over to: https://linktr.ee/uisato

#animation #ai #cinematic

TTimson <@205854764540362752> can you use FLUX.1 Fill and FLUX.1 Depth with lora / dream...

Furkan Gözükara SECourses•12/21/24, 10:17 PM

i think not working that way yet. but i was able to use redux with fine tuned model

�🍭🎀 𝒜𝓋𝒶 𝐹𝓇𝒾𝑔𝑔 🎀🍭<@205854764540362752> question about flux dev fine tune and reg. images. A thoug...

Furkan Gözükara SECourses•12/21/24, 10:18 PM

reg images just not working so probably wouldnt make sense

Furkan Gözükara SECourses•12/21/24, 10:18 PM

just try to improve dataset

D4UNTLESS•12/22/24, 9:01 AM

How long does the massed compute "Initializing" take? It's been more than 10 minutes now

Timson•12/22/24, 9:32 AM

the bigger initialisation image - the longer it takes

10 minutes is sensible for larger desktop images

TTimson the bigger initialisation image - the longer it takes 🤷‍♂️ 10 minutes is sensib...

D4UNTLESS•12/22/24, 9:59 AM

Thanks! It took around 15 minutes. I have started the training

DD4UNTLESS How long does the massed compute "Initializing" take? It's been more than 10 min...

Furkan Gözükara SECourses•12/22/24, 10:07 AM

last time it took 13 for me

DD4UNTLESS Thanks! It took around 15 minutes. I have started the training 🤞

Furkan Gözükara SECourses•12/22/24, 10:08 AM

after init training takes me like 3 minutes to start including model download

D4UNTLESS•12/22/24, 11:06 AM

DD4UNTLESS Click to see attachment

D4UNTLESS•12/22/24, 11:07 AM

@Furkan Gözükara SECourses

DD4UNTLESS Click to see attachment

Furkan Gözükara SECourses•12/22/24, 11:15 AM

what config

Furkan Gözükara SECourses•12/22/24, 11:15 AM

did you generate samples?

Furkan Gözükara SECourses•12/22/24, 11:16 AM

are your all images 1024x1024?

D4UNTLESS•12/22/24, 11:21 AM

Batch 7 config. Dreambooth training. Images are in different resolutions. I turned on bucketing

DD4UNTLESS Batch 7 config. Dreambooth training. Images are in different resolutions. I turn...

Furkan Gözükara SECourses•12/22/24, 11:23 AM

that is why

Furkan Gözükara SECourses•12/22/24, 11:23 AM

reduce batch size to 6

Furkan Gözükara SECourses•12/22/24, 11:23 AM

batch size 7 barely fits when all 1024x1024 and no bucket

FFurkan Gözükara SECourses did you generate samples?

D4UNTLESS•12/22/24, 11:43 AM

You mentioned in the tutorial to not do sampling. I just followed your tutorial.

FFurkan Gözükara SECourses reduce batch size to 6

D4UNTLESS•12/22/24, 11:43 AM

Can I use batch 1 48gb on A6000? Will it take longer?

Timson•12/22/24, 12:07 PM

Btw, I still don't understand what's the point of batching if you have 100% CUDA cores utilisation on batch size 1?

DD4UNTLESS Can I use batch 1 48gb on A6000? Will it take longer?

Furkan Gözükara SECourses•12/22/24, 2:15 PM

yes you can use

Furkan Gözükara SECourses•12/22/24, 2:15 PM

it will take a little bit longer

TTimson Btw, I still don't understand what's the point of batching if you have 100% CUDA...

Furkan Gözükara SECourses•12/22/24, 2:15 PM

it still slightly improves speed

Furkan Gözükara SECourses•12/22/24, 2:16 PM

but for quality it reduces a little bit

Furkan Gözükara SECourses•12/22/24, 2:16 PM

you can calculate

Furkan Gözükara SECourses•12/22/24, 2:16 PM

batch size 1 = 6.4 second / it = 7*6.4 second for 7 samples

Furkan Gözükara SECourses•12/22/24, 2:16 PM

batch size 7 = 29.5 second / it

Furkan Gözükara SECourses•12/22/24, 2:16 PM

also i recommend now L40 with our coupon now only 49 cents

D4UNTLESS•12/22/24, 3:35 PM

I'm using cropping + padding to bring all my images to 1024x1024
So I'm not gonna use bucketing. Should I go with batch size 6?

DD4UNTLESS I'm using cropping + padding to bring all my images to 1024x1024 So I'm not gonn...

Furkan Gözükara SECourses•12/22/24, 4:16 PM

you shouldnt get

Furkan Gözükara SECourses•12/22/24, 4:16 PM

i tested many times

Furkan Gözükara SECourses•12/22/24, 4:16 PM

batch size 7 should work no bucket @ 1024 px

DD4UNTLESS Click to see attachment

D4UNTLESS•12/22/24, 4:36 PM

So this error was because of different resolutions? If I use all images with 1024x1024 now (no bucketing) I will not get errors at batch size 7? Just need to get confirmation from you

<@205854764540362752> do you have experience with training multiple models at a time with Kohya on m

Similar Threads

Similar Threads

Similar Threads