With Gradio, using Wan 2.1 T2V 480 I can generate a video.
But using Wan I2V 480 16B the GUI send back an error (Connection errored out.), and the prompt gives "press any button" . What can be?
Log:
To create a public link, set share=True in launch().
WAN 2.1 1.3B (Text/Video-to-Video)
WAN 2.1 14B Image-to-Video 480P
[CMD] No LoRA selected. Using base model.
[CMD] Loading model: 14B_image_480p with torch dtype: torch.bfloat16 and num_persistent_param_in_dit: 1200000000
Loading models from: ['models/Wan-AI/Wan2.1-I2V-14B-480P/diffusion_pytorch_model-00001-of-00007.safetensors', 'models/Wan-AI/Wan2.1-I2V-14B-480P/diffusion_pytorch_model-00002-of-00007.safetensors', 'models/Wan-AI/Wan2.1-I2V-14B-480P/diffusion_pytorch_model-00003-of-00007.safetensors', 'models/Wan-AI/Wan2.1-I2V-14B-480P/diffusion_pytorch_model-00004-of-00007.safetensors', 'models/Wan-AI/Wan2.1-I2V-14B-480P/diffusion_pytorch_model-00005-of-00007.safetensors', 'models/Wan-AI/Wan2.1-I2V-14B-480P/diffusion_pytorch_model-00006-of-00007.safetensors', 'models/Wan-AI/Wan2.1-I2V-14B-480P/diffusion_pytorch_model-00007-of-00007.safetensors']
model_name: wan_video_dit model_class: WanModel
This model is initialized with extra kwargs: {'model_type': 'i2v', 'patch_size': (1, 2, 2), 'text_len': 512, 'in_dim': 36, 'dim': 5120, 'ffn_dim': 13824, 'freq_dim': 256, 'text_dim': 4096, 'out_dim': 16, 'num_heads': 40, 'num_layers': 40, 'window_size': (-1, -1), 'qk_norm': True, 'cross_attn_norm': True, 'eps': 1e-06}
Press any key to continue . . .To create a public link, set share=True in launch().
WAN 2.1 1.3B (Text/Video-to-Video)
WAN 2.1 14B Image-to-Video 480P
[CMD] No LoRA selected. Using base model.
[CMD] Loading model: 14B_image_480p with torch dtype: torch.bfloat16 and num_persistent_param_in_dit: 1200000000
Loading models from: ['models/Wan-AI/Wan2.1-I2V-14B-480P/diffusion_pytorch_model-00001-of-00007.safetensors', 'models/Wan-AI/Wan2.1-I2V-14B-480P/diffusion_pytorch_model-00002-of-00007.safetensors', 'models/Wan-AI/Wan2.1-I2V-14B-480P/diffusion_pytorch_model-00003-of-00007.safetensors', 'models/Wan-AI/Wan2.1-I2V-14B-480P/diffusion_pytorch_model-00004-of-00007.safetensors', 'models/Wan-AI/Wan2.1-I2V-14B-480P/diffusion_pytorch_model-00005-of-00007.safetensors', 'models/Wan-AI/Wan2.1-I2V-14B-480P/diffusion_pytorch_model-00006-of-00007.safetensors', 'models/Wan-AI/Wan2.1-I2V-14B-480P/diffusion_pytorch_model-00007-of-00007.safetensors']
model_name: wan_video_dit model_class: WanModel
This model is initialized with extra kwargs: {'model_type': 'i2v', 'patch_size': (1, 2, 2), 'text_len': 512, 'in_dim': 36, 'dim': 5120, 'ffn_dim': 13824, 'freq_dim': 256, 'text_dim': 4096, 'out_dim': 16, 'num_heads': 40, 'num_layers': 40, 'window_size': (-1, -1), 'qk_norm': True, 'cross_attn_norm': True, 'eps': 1e-06}
Press any key to continue . . .