#Whisper | Writing | Ozgur Yildiz

Apr 21, 2025

Speaker diarization: turning a transcript into 'who said what.'

Transcription gives you text. Diarization adds speaker identity. How diarization works, the tools available, and how to combine it with Whisper output.
Apr 17, 2025

Chunking audio for transcription: size, overlap, and the timing that matters.

The Whisper API has a 25MB file size limit. For long recordings, chunking is required. How to split audio correctly so transcription quality doesn't suffer at the boundaries.
Apr 14, 2025

OpenAI Whisper API: what the response actually looks like and how to use it.

A practical look at the Whisper transcription API response format, the verbose JSON mode with timestamps, and how to use the output in real applications.