Hi everyone, as the title suggests, I'm encountering an issue where the transcription occasionally might repeat the same word/sentence. When this occurs it ruins the entire transcription from the point where it happens. My use case 90% of the time will be large audio recordings ranging from 40 to 120 minutes.
From what I read this seems like a semi-common whisper issue but I haven't found any consistent solutions. Some things I've tried: - Using large-v2 instead of large-v3 - enabling VAD Other than that I haven't adjusted any different parameters.