Hi guys! I've been training FLUX LoRAs using Kohya on Windows with 4 RTX GPUs. About a month ago, everything worked fine, but now I keep getting crashes related to torch.distributed.get_world_size() when using Accelerate.

Has something changed recently in PyTorch or Accelerate that breaks multi-GPU training on Windows?
Is there any recommended workaround for this?

EElli_Oaken Hi guys! I've been training FLUX LoRAs using Kohya on Windows with 4 RTX GPUs. A...

Furkan Gözükara SECourses•6/2/25, 2:52 PM

yes you need like

Furkan Gözükara SECourses•6/2/25, 2:52 PM

SXM or NVLINK machines

Original message was deleted

AiInfluence•6/6/25, 10:12 AM

Hi Rex , when did he state that?

Original message was deleted

AiInfluence•6/6/25, 10:15 AM

so whats the workaround

I just did fine-tuning and am now getting 'AssertionError: You do not have CLIP state dict!' Does an

Similar Threads