Software Engineering Courses (SECourses)•15mo ago

oh so he wouldn't have knowledge about training? I have a couple of questions about Flux style train

oh so he wouldn't have knowledge about training? I have a couple of questions about Flux style training can i dm you?

deepblue_1337•10/16/24, 1:01 PM

normal (not batch size 7) works

Mmagical oh so he wouldn't have knowledge about training? I have a couple of questions ab...

Furkan Gözükara SECourses•10/16/24, 2:28 PM

i have full flux style training guide did you read?

Furkan Gözükara SECourses•10/16/24, 2:28 PM

https://huggingface.co/MonsterMMORPG/3D-Cartoon-Style-FLUX

MonsterMMORPG/3D-Cartoon-Style-FLUX · Hugging Face

Ddeepblue_1337 normal (not batch size 7) works

Furkan Gözükara SECourses•10/16/24, 2:28 PM

ye looks like 48 gb not sufficient

Furkan Gözükara SECourses•10/16/24, 2:28 PM

for batch size 7

Furkan Gözükara SECourses•10/16/24, 2:30 PM

wow i am able to open multiple thinlinc client on windows 11

Ddeepblue_1337 normal (not batch size 7) works

Furkan Gözükara SECourses•10/16/24, 2:45 PM

single A6000 batch size 7 works

Furkan Gözükara SECourses•10/16/24, 2:45 PM

fine tuning

Furkan Gözükara SECourses•10/16/24, 2:45 PM

dsienra•10/16/24, 3:14 PM

@Dr. Furkan Gözükara Update on https://huggingface.co/nyanko7/flux-dev-de-distill New training, catastrophic failure with Apply T5 Attention Mask, and Train T5-XXL enabled. the lora bleeds and is no learning the concepts, the prompt "token class" renders random things like a dog or something on regular flux-dev, on inference using flux-dev-de-distill kinda works sometimes, one image have resembles the next bleeds, all over the place. I going to disable Apply T5 Attention Mask, and Train T5-XXL, one of this options is breaking the model. all tests images have horizontal lines also.

nyanko7/flux-dev-de-distill · Hugging Face

dsienra•10/16/24, 3:20 PM

@Dr. Furkan Gözükara may be the problem wasn't the learning rate, may be is one or both of those Apply T5 Attention Mask, and Train T5-XXL, what do you think?

FFurkan Gözükara SECourses i have full flux style training guide did you read?

magicalOP•10/16/24, 3:40 PM

I did. Is there much quality difference between using Rank 3 config vs 4x_GPU_Rank_1_SLOW?

magicalOP•10/16/24, 3:41 PM

If not the only think I didn't try is training without captions

Ddsienra @Dr. Furkan Gözükara may be the problem wasn't the learning rate, may be is one ...

Furkan Gözükara SECourses•10/16/24, 3:48 PM

i think it could be all so every case has to be tested

Mmagical I did. Is there much quality difference between using Rank 3 config vs 4x_GPU_Ra...

Furkan Gözükara SECourses•10/16/24, 3:48 PM

yes small diff

FFurkan Gözükara SECourses yes small diff

magicalOP•10/16/24, 3:50 PM

oh and I didn't try training T5 XXL. If there is no text in my training images would it still improve the training?

dsienra•10/16/24, 4:06 PM

After my failure I'm not so sure about "Apply T5 Attention Mask, and Train T5-XXL enabled" you are training the T5, but in inference you load a T5 model that doesn't know your token. forge o SwarmUI load the regular T5 model that was not trained. so I'm not so sure

Ddsienra After my failure I'm not so sure about "Apply T5 Attention Mask, and Train T5-XX...

magicalOP•10/16/24, 4:18 PM

oh the t5 model should be trained as well for it to work properly?

dsienra•10/16/24, 4:18 PM

@Dr. Furkan Gözükara may be Apply T5 Attention Mask can help, but I don't understand how can Train T5-XXL helps, because on inference you load a T5 model that was not trained . forge or SwarmUI load the regular T5 model that was not trained

Mmagical oh the t5 model should be trained as well for it to work properly?

dsienra•10/16/24, 4:23 PM

Is the thing I don't understand, so really IDK now I'm training again with both options disabled, My question can apply to everyone, but in my case is different because I'm training flux-dev-de-distill so in my case is broking something. I'm having artifacts on the images with this on and also is not learning the concepts correctly

dsienra•10/16/24, 7:04 PM

@Dr. Furkan Gözükara Update on https://huggingface.co/nyanko7/flux-dev-de-distill I started new training - "T5 Attention Mask and T5-XXL both disabled" same lr 30 epochs now. I tested the lora checkpoint and all problems are fixed, is going great using regular flux-dev-fp8 for inference, I'm training tree people same class, same prompt changing just the name renders the correct subject, is undertrained but goes very well so far

nyanko7/flux-dev-de-distill · Hugging Face

ar.1•10/16/24, 9:42 PM

ar.1•10/16/24, 9:42 PM

did i do something wrong? why is there 17.6 million steps ??

Mmagical oh and I didn't try training T5 XXL. If there is no text in my training images w...

Furkan Gözükara SECourses•10/16/24, 9:42 PM

T5 XXL made minor diff that i didnt notice :d

Aar.1 Click to see attachment

Furkan Gözükara SECourses•10/16/24, 9:43 PM

yep you did indeed

Furkan Gözükara SECourses•10/16/24, 9:43 PM

sub folder

Furkan Gözükara SECourses•10/16/24, 9:43 PM

more than 1 repeat

FFurkan Gözükara SECourses more than 1 repeat

ar.1•10/16/24, 9:45 PM

ah it was at 40

Aar.1 ah it was at 40

Furkan Gözükara SECourses•10/16/24, 9:45 PM

yep

FFurkan Gözükara SECourses yep

ar.1•10/16/24, 9:46 PM

ar.1•10/16/24, 9:46 PM

is that too many still

ar.1•10/16/24, 9:46 PM

2206 images

Aar.1 Click to see attachment

Furkan Gözükara SECourses•10/16/24, 9:46 PM

sub folders

Vigilence•10/16/24, 9:47 PM

The config could use an update as he made the same mistake as me.

VVigilence The config could use an update as he made the same mistake as me.

Furkan Gözükara SECourses•10/16/24, 9:48 PM

i tested config today works as expected

Vigilence•10/16/24, 9:48 PM

The repeat was at 40 when I used it, is that not saved in the config?

VVigilence The repeat was at 40 when I used it, is that not saved in the config?

Furkan Gözükara SECourses•10/16/24, 9:49 PM

nope

Furkan Gözükara SECourses•10/16/24, 9:49 PM

it is read from folder

Furkan Gözükara SECourses•10/16/24, 9:49 PM

you need to fix folder name

Furkan Gözükara SECourses•10/16/24, 9:49 PM

i explained this in details in tutorial

Furkan Gözükara SECourses•10/16/24, 9:49 PM

but you are skipping :d

Vigilence•10/16/24, 9:49 PM

I see. Lol.

Furkan Gözükara SECourses•10/16/24, 9:50 PM

Dazzastrous [4090]•10/16/24, 9:50 PM

need to be more vigilent?

Dazzastrous [4090]•10/16/24, 9:51 PM

had to

Dazzastrous [4090]•10/16/24, 9:51 PM

lol

Vigilence•10/16/24, 9:51 PM

So here is what I suggest, as I watched some parts of all your tutorials, but not all (and sometimes the bookmarks are wrong). Add an image in the config file showing the repeat section and saying it needs to be 1.

Vigilence•10/16/24, 9:51 PM

Should be labeled (Step 1: Read First!) or something similar.

Dazzastrous [4090]•10/16/24, 9:52 PM

yeh not the easiest to navigate through

Dazzastrous [4090]•10/16/24, 9:52 PM

so many videos

oh so he wouldn't have knowledge about training? I have a couple of questions about Flux style train

Similar Threads