#Whisper
3 posts tagged with this topic. ← All tags
-
Speaker diarization: turning a transcript into 'who said what.'
Transcription gives you text. Diarization adds speaker identity. How diarization works, the tools available, and how to combine it with Whisper output.
-
Chunking audio for transcription: size, overlap, and the timing that matters.
The Whisper API has a 25MB file size limit. For long recordings, chunking is required. How to split audio correctly so transcription quality doesn't suffer at the boundaries.
-
OpenAI Whisper API: what the response actually looks like and how to use it.
A practical look at the Whisper transcription API response format, the verbose JSON mode with timestamps, and how to use the output in real applications.