OpenAI's speech recognition model that converts any recording into text.
By Thiago Lourenço Martins
Whisper is a speech recognition model created by OpenAI and released for free as open-source software. Think of it as a stenographer who never gets tired: you hand it a recording — a hearing, a meeting, a consultation — and it returns the complete text, without you typing a single word. Audio goes in, text comes out. That's it.
A two-hour court hearing can generate over 30 pages of text if someone has to type it all manually. An expert witness who records findings by voice and then rewrites the same content. A manager who leaves a meeting without knowing what was decided because no one could write it all down.
That is how long Whisper takes to transcribe 90 minutes of good-quality audio. The model processes speech faster than real time — what would take hours of typing is ready before you finish your coffee.
Records the hearing on a phone, uploads the file to Whisper, and gets the full transcript. Instead of spending hours reconstructing what was said, they start drafting legal documents immediately — with the actual testimony in hand.
Transcribes recorded job interviews and generates a summary per candidate. Eliminates note-taking during the conversation and keeps the focus on the interviewee, not the notepad.
Dictates medical notes by voice during or after the appointment. Whisper converts them to structured text, ready to review and save. No dictaphone, no secretary, no rework.
Records field interviews on a phone. Returns to the office with an audio file and, in minutes, has the full transcript ready to turn into a story — no time wasted listening and retyping.
Records meetings and transcribes afterward. Uses the text with ChatGPT to extract decisions, owners and deadlines. The meeting summary that took an hour to write is ready in 5 minutes.
The method below uses Google Colab — a free, browser-based code notebook. No need to install anything. You don't need to know how to code: just copy and paste.
Sign in with any Google account. It is free.
An empty code cell appears on screen.
think of it as a single instruction line you will send to the computerIn the cell, paste exactly: !pip install openai-whisper — then click the triangle on the left. Wait for installation (may take 1–2 minutes).
In the left panel, click the folder icon. Then drag your audio file (.mp3, .mp4, .wav, .m4a) into the area that appears. Wait for the upload to finish.
the file disappears when you close Colab — this is normal; the results stay separatelyClick "+ Code" to create another cell. Paste the block below, replacing hearing.mp3 with the exact name of your file:
import whisper
model = whisper.load_model("medium")
result = model.transcribe("hearing.mp3", language="pt")
print(result["text"])The model loads and transcribes. For a 90-minute audio file, the process takes about 3 to 6 minutes. The full text appears just below the cell.
use "base" for faster results; use "large-v3" for maximum accuracy on technical termsAfter getting the text from Whisper, paste this request into ChatGPT — along with the transcript:
Here is the transcript of a court hearing: [paste the text].
Create a formal summary with the following sections:
1. Participants (identify by role: judge, defense attorney, opposing counsel, witness)
2. Undisputed facts — points accepted by both parties
3. Points in dispute — key disagreements
4. Decisions made — with deadline and responsible party when mentioned
Use formal legal language and organize in numbered bullet points.
You get a structured summary ready to review, adapt, and sign — without rewriting everything from scratch.
* Independent suggestion, chosen for content quality. We have no relationship or sponsorship with this channel.
Record a 1-minute voice note on your phone — read any paragraph from a document in your field. Follow the steps above with the "base" model (fastest) and compare the generated text to the original.
It worked if more than 90% of the words are correct — including technical terms from your field. If it falls short, switch to "medium" and compare again.
Every day, a new tool explained.
Receive on WhatsApp