Software Engineering Courses (SECourses)•5mo ago

Sampler: DPM++ 2M Scheduler: Karras Sooo perfection, realistic... Like Me..

Sampler: DPM++ 2M
Scheduler: Karras

Sooo perfection, realistic... Like Me..

Ccyberbol Sampler: DPM++ 2M Scheduler: Karras Sooo perfection, realistic... Like Me..

JonkoXL•8/13/25, 12:01 PM

It took Edison 1000 tries to make the light bulb.

AiInfluence•8/13/25, 12:01 PM

no he stole the design from tesla

JonkoXL•8/13/25, 12:01 PM

Yeah that makes it even worse for Edison

JonkoXL•8/13/25, 12:01 PM

Even with Tesla's blueprints, it took him 1000 tries

Ccyberbol Sampler: DPM++ 2M Scheduler: Karras Sooo perfection, realistic... Like Me..

JonkoXL•8/13/25, 12:02 PM

What happens if you use UNI_PC with SGM_UNIFORM

AiInfluence•8/13/25, 12:02 PM

i hate edison

JJonkoXL What happens if you use UNI_PC with SGM_UNIFORM

cyberbolOP•8/13/25, 12:02 PM

GPU will burn

cyberbolOP•8/13/25, 12:03 PM

Hope @Furkan Gözükara SECourses will show us big comparision

cyberbolOP•8/13/25, 12:03 PM

Tne we can vote which one is best combination!! Yee voting good idea

AiInfluence•8/13/25, 12:04 PM

just amazing

AAiInfluence i hate edison

JonkoXL•8/13/25, 12:04 PM

same

AiInfluence•8/13/25, 12:04 PM

i love Ada Lovelace

Ccyberbol I've been testing several samplers and schedulers for the FLUX KREA model, and i...

skyrrr•8/13/25, 12:27 PM

Do you know if it's possible to make those new samplers/schedulers work in SwarmUI ? I believe they come from RES4LYF nodes in comfy right ?

Sskyrrr Do you know if it's possible to make those new samplers/schedulers work in Swarm...

cyberbolOP•8/13/25, 12:28 PM

Yes, you need use a comfyui as Beckends for SmwarUI and install this mentioned by you RES4LYF

skyrrr•8/13/25, 12:29 PM

Awesome. Thanks mate

cyberbolOP•8/13/25, 12:33 PM

cyberbolOP•8/13/25, 12:33 PM

no way to choose lol.

cyberbolOP•8/13/25, 12:38 PM

JonkoXL•8/13/25, 12:39 PM

4th row 1st column of the blue ones on the boat

JonkoXL•8/13/25, 12:40 PM

1st row last column also

JonkoXL•8/13/25, 12:42 PM

JonkoXL•8/13/25, 12:43 PM

Animation I made using my Lora + a new workflow I am working on (for extending videos)

cyberbolOP•8/13/25, 12:54 PM

Sooo many combination..lol. I need go for a walk.. clear my brain.

PROMPT:

"close headshot portrait photograph of white ohwx man wearing a super expensive white suit, a light blue shirt and a black tie in a photo studio lighting hdr. The studio backdrop is a rich, deep black that makes the white suit pop dramatically. Professional lighting creates a perfect balance of highlights and shadows, accentuating his facial features. The suit is impeccably tailored, with subtle textures visible in the fabric. His tie is made of the finest silk, with a subtle pattern that catches the light. His hair is perfectly styled, and his expression is one of confident professionalism.<segment:yolo-face_yolov9c.pt-1,0.7,0.5>photograph of ohwx man"

EULER / BETA

30/40 steps

CFG: 1 , 2, 2.5

cyberbolOP•8/13/25, 2:12 PM

Hey everyone,

Kicking off a new training run for my Qwen-Image likeness LoRA today. This time, I'm diving into some of the newer, more advanced parameters available in Musubi-Tuner to see if they can improve the final quality.

Here's what I'm experimenting with:

LoRA+: Using a loraplus_lr_ratio of 4, which is supposed to make the training more efficient by using a higher learning rate on part of the network.

PyTorch Dynamo: Enabled the inductor backend. I'm curious to see if the JIT compiler gives any extra speed boost on top of the performance gains from WSL.

New Timestep Sampler: Switched to the qinglong_qwen sampler. It's a hybrid method that's reportedly better for style and likeness learning.

Post-Hoc EMA Merging: After the training is done, I'll also be using the new script to merge the best checkpoints into one final model instead of just picking a single one. The goal is a more stable and accurate LoRA.

The training is running in WSL2 (Ubuntu) using the Musubi-Tuner scripts on an RTX 4090.

I'm really curious to see what effect these new parameters will have on the final result. I'll let you know how it goes!

FFurkan Gözükara SECourses 😄

Dr. Odin•8/13/25, 2:26 PM

I got a question..in one trainer.. when setting the captions from one txt file (ohwx man) does this apply when training a acharacter.. same body chape and size..and face too... or is it only for face... my images have tags and it kind of threw me off abit..

JJonkoXL Animation I made using my Lora + a new workflow I am working on (for extending v...

Sakami•8/13/25, 3:49 PM

nice! how many seconds do you think are achievable while keeping the context

JonkoXL•8/13/25, 3:52 PM

5 seconds, since that's the 81 frames generated per clip.
I noticed that when she turned her back, and then turned to the front again (as a test) she looked different. (when reusing the last frame)

JonkoXL•8/13/25, 3:52 PM

Hopefully they will make an IP Adapter for Wan2.2

Sakami•8/13/25, 4:00 PM

nice
I’ve heard that some folks managed to get 7 to even 10 seconds in a single generation, but I’m not an expert in WAN so i dunno if maybe this was some diffrent case

SSakami nice I’ve heard that some folks managed to get 7 to even 10 seconds in a single ...

JonkoXL•8/13/25, 5:47 PM

yes it can, but it may distort a bit.
any thing above a single clip length will lose context tho

JonkoXL•8/13/25, 5:47 PM

unless we get ip adapters or some kind of forging stuff

AiInfluence•8/13/25, 5:48 PM

cyber

AiInfluence•8/13/25, 5:48 PM

so far flux krea or qwen?>

AiInfluence•8/13/25, 5:48 PM

what do you like

AAiInfluence cyber

cyberbolOP•8/13/25, 6:29 PM

Damn.. cant choose one. I like both. Just finished second training qwen lora with new parameters. I like result. will post soon.

cyberbolOP•8/13/25, 6:31 PM

Just wanted to share an update: I've finished my new Qwen-Image LoRA training, and I'm really happy with the results! It's a huge improvement over my first attempt.

I think the new, advanced features in Musubi-Tuner made a massive difference. For this run, I used:

LoRA+ to make the training more efficient.

PyTorch Dynamo (inductor backend) to compile and optimize the model on the fly in WSL.

The qinglong_qwen sampler for a more intelligent learning process.

After the training was complete, I picked my top 3 favorite checkpoints. Then, instead of just choosing one, I merged them into a single, final model using the lora_post_hoc_ema.py script. This command averages the weights of the best LoRAs to create a more stable and higher-quality final version.

There's still some work to do on finding the best sampler and scheduler, but I'm very pleased with the outcome so far. The training took a bit longer than expected, around 4 hours on my RTX 4090, which was likely due to the initial, slow compilation phase of PyTorch Dynamo that needs time to warm up.

Here are a few example images generated with the final merged LoRA + 8stepLora.

cyberbolOP•8/13/25, 6:38 PM

Dataset just 20 images like last time.

Command in Musumi-TUner to start:

python3 src/musubi_tuner/qwen_image_cache_latents.py --dataset_config dataset/dataset_qwen_test.toml --vae models/qwen_image/vae/vae/diffusion_pytorch_model.safetensors

python3 src/musubi_tuner/qwen_image_cache_text_encoder_outputs.py --dataset_config dataset/dataset_qwen_test.toml --text_encoder models/qwen_image/text_encoders/split_files/text_encoders/qwen_2.5_vl_7b.safetensors

PYTORCH_CUDA_ALLOC_CONF=expandable_segments:True accelerate launch --num_cpu_threads_per_process 1 src/musubi_tuner/qwen_image_train_network.py \
--dataset_config dataset/dataset_qwen_test.toml \
--dit models/qwen_image/diffusion_models/split_files/diffusion_models/qwen_image_bf16.safetensors \
--text_encoder models/qwen_image/text_encoders/split_files/text_encoders/qwen_2.so_vl_7b.safetensors \
--vae models/qwen_image/vae/vae/diffusion_pytorch_model.safetensors \
--output_dir output_qwen_advanced \
--output_name Qwen-LoRA-Arek-Advanced \
--mixed_precision bf16 \
--sdpa \
--optimizer_type adamw8bit \
--learning_rate 2e-4 \
--lr_scheduler cosine_with_restarts \
--lr_warmup_steps 300 \
--lr_scheduler_num_cycles 3 \
--network_module networks.lora_qwen_image \
--network_dim 32 \
--network_alpha 16 \
--network_args "loraplus_lr_ratio=4" \
--max_train_steps 3000 \
--save_every_n_steps 250 \
--gradient_checkpointing \
--timestep_sampling qinglong_qwen \
--dynamo_backend INDUCTOR \
--fp8_base \
--fp8_scaled \
--blocks_to_swap 20

AiInfluence•8/13/25, 6:41 PM

well , i still like krea more with the examples you shown yesterday

Sskyrrr What's your take on buckets ? Do you notice better results without ? I use diffe...