Hello everyone. I am Dr. Furkan Gözükara. PhD Computer Engineer. SECourses is a dedicated YouTube channel for the following topics : Tech, AI, News, Science, Robotics, Singularity, ComfyUI, SwarmUI, ML, Artificial Intelligence, Humanoid Robots, Wan 2.2, FLUX, Krea, Qwen Image, VLMs, Stable Diffusion
MIDI is a novel paradigm for compositional 3D scene generation from a single image, extending pre-trained 3D object generation models to multi-instance diffusion models for simultaneous generation of multiple 3D instances.
@Furkan Gözükara SECourses Are you seeing the full 32GB in windows from your 5090? Mine only reports 31.5GB and its the difference between fitting the full Wan Model or not.. My 3090 reports 24.0 - in the same task manager so I'm thinking its not the 1024 calculation.
Yeah something different between 30xx/50xx. My 3090 = 24.0 in task manager, 5090 = 31.5 in task manager - and I can't load the full FP16 WAN 2.1 model in 32GB because its 500MB short!! it goes to shared GPU which slows it down a lot.
@Furkan Gözükara SECourses have you found perfect teacache settings for Wan2.1? with no visible quality loss to me: teacache at 0.250 T2V 14B model, with Sage and torch-acceleration i got a 49 frame Wan down to 101 seconds. I can get it faster, but start to notice degradation.
Will you be doing wan/hunyuan lora training tests? I got hunyuan down well, but wan lora's are not coming out well for me with musubi-tuner, but can't tell if its 5090 problem. So much instability with cuda 12.8, sage, triton 3.3