Hello everyone. I am Dr. Furkan Gözükara. PhD Computer Engineer. SECourses is a dedicated YouTube channel for the following topics : Tech, AI, News, Science, Robotics, Singularity, ComfyUI, SwarmUI, ML, Artificial Intelligence, Humanoid Robots, Wan 2.2, FLUX, Krea, Qwen Image, VLMs, Stable Diffusion
Well, do you think it makes sense to configure a ready-made model for text to image or video or to develop a model via pytorch I can develop basic Gan's models, but it is really difficult to find model examples with narration and open source in more advanced structures in this direction, do you think it would make sense to use the stabel diffuison model by customizing it?
you should consider using kofi instead of patreon btw, that way you get 100% of the amount people pay monthly instead of patreon getting a huge chunk of what we pay
is anyone familiar with a custom node in comfyUI that can read images from a URL? looking for a way to read image batches for controlnet but reading from a library on AWS
Understanding and Generating Music Intrinsically with LLM
While Large Language Models (LLMs) demonstrate impressive capabilities in text generation, we find that their ability has yet to be generalized to music, humanity's creative language. We introduce…