Avoiding hallucinations/repetitions when using the faster whisper worker ?
worker:
https://github.com/runpod-workers/worker-faster_whisper
Hi everyone, as the title suggests, I'm encountering an issue where the transcription occasionally might repeat the same word/sentence.
When this occurs it ruins the entire transcription from the point where it happens.
My use case 90% of the time will be large audio recordings ranging from 40 to 120 minutes.
From what I read this seems like a semi-common whisper issue but I haven't found any consistent solutions.
Some things I've tried:
- Using large-v2 instead of large-v3
- enabling VAD
Other than that I haven't adjusted any different parameters.
Any help will be greatly appreciated! πππ:poddy:
GitHub
GitHub - runpod-workers/worker-faster_whisper: π§ | RunPod worker of...
π§ | RunPod worker of the faster-whisper model for Serverless Endpoint. - runpod-workers/worker-faster_whisper

1 Reply
Unknown Userβ’9mo ago
Message Not Public
Sign In & Join Server To View