if it a sdxl model, I think you need 1024x1024 images for training
if it a sdxl model, I think you need 1024x1024 images for training

Clip_Interrogator.py and use Clip_Interrogator Gradio Web UI with ViT-bigG-14/laion2b_s39b_b160k and blip2-2.7b with short generation time, and I have almost the same GPU, shouldn't I be able to do the same?



Clip_Interrogator.py with "best models" uses 19GB VRAM. Maybe I misunderstood?
Clip_Interrogator.pyClip_Interrogator.py