Hello everyone. I am Dr. Furkan Gözükara. PhD Computer Engineer. SECourses is a dedicated YouTube channel for the following topics : Tech, AI, News, Science, Robotics, Singularity, ComfyUI, SwarmUI, ML, Artificial Intelligence, Humanoid Robots, Wan 2.2, FLUX, Krea, Qwen Image, VLMs, Stable Diffusion
found this on internet: "suggests that your inputs are on the GPU, but your model is not. So I’d double check whether your model is effectively on the GPU."
Hello @Furkan Gözükara SECourses, which local text-to-speech model do you recommend for longform content that is comparable to ElevenLabs (without the price tag)? Thank you
Hey all, having great fun with SUPIR V22 right now - once in a while (havent figured out why) I got an error with the face restoration, I thought maybe its something with resolution but I will break on a 1920x1080 image or others too. Here are the errors, any help is appreciated!
I would use I would use https://github.com/jhj0517/Whisper-WebUI this comes with Open AIs whisper built in. Take your 2 hour video transcode to mp3 then run it through, it'll take a while but it'll be accurate!