Hello everyone. I am Dr. Furkan Gözükara. PhD Computer Engineer. SECourses is a dedicated YouTube channel for the following topics : Tech, AI, News, Science, Robotics, Singularity, ComfyUI, SwarmUI, ML, Artificial Intelligence, Humanoid Robots, Wan 2.2, FLUX, Krea, Qwen Image, VLMs, Stable Diffusion
The captions is easily created by CLIP interrogator and some manual labor. Question is what kind of dataset would be needed :/ All training videos (Including your fantastic videos) are of subject/object training or an art style. Not of a design