Transcribe MP3 or MP4 meeting recordings via a RunPod serverless endpoint. Input: a URL to an audio/video file (e.g. from Google Cloud Storage). Output: CSV with speaker_id and utterance (speaker ...
Abstract: The “Whisper“ model provides a tool for those who require transcription of human voice. It equips with opensource features and diverse functionalities. The model is capable of effectively ...