
Portuguese Transcription: Brazilian vs European Portuguese Done Right
Two Portuguese, One Transcription Problem
Portuguese is the world's sixth most spoken language, with 260 million speakers split between two main standards. Brazilian Portuguese accounts for about 215 million of them, European Portuguese the rest. They share grammar and vocabulary but diverge in pronunciation, regional accents, and even some spelling conventions after the 2009 Acordo Ortográfico.
If you transcribe Portuguese audio with the wrong dialect setting, expect 10 to 15 points of accuracy loss. This guide shows you how to set things up correctly and what to watch for.
Brazilian vs European Portuguese: What Actually Changes
The two variants look nearly identical on paper. Spoken, they sound like different languages to untrained ears.
Pronunciation differences
- Open vowels in Brazilian Portuguese are much more open. "Café" in Brazil sounds like "kah-fay", in Portugal more like "kuh-feh".
- European Portuguese reduces or drops unstressed vowels. "Esperança" in Brazil sounds like "espe-RAN-sa", in Portugal closer to "shpRANSa".
- The letter "s" at the end of a syllable is hissed (sh-sound) in European Portuguese, soft in Brazilian Portuguese.
- Brazilian Portuguese palatalizes "t" and "d" before "i" or final "e". "Bom dia" in Brazil sounds like "bom DJI-a", in Portugal "bom DEE-a".
Vocabulary differences
- Train: trem (BR) vs comboio (PT).
- Bus: ônibus (BR) vs autocarro (PT).
- Cell phone: celular (BR) vs telemóvel (PT).
- Bathroom: banheiro (BR) vs casa de banho (PT).
Grammar drift
Brazilian Portuguese uses gerunds ("estou fazendo") where European Portuguese prefers infinitive constructions ("estou a fazer"). Mismatching variant settings makes the AI choose the wrong form.
Whisper Large-v3 Handles Both, but Set the Right Variant
By default, Portuguese transcription uses Whisper Large-v3 with auto-detection across both variants. Auto-detect lands in the right bucket about 92 percent of the time on clean audio.
If you know your variant in advance:
- Brazilian podcast or YouTube video, set the regional hint to pt-BR.
- Lisbon news interview or Portuguese-language European film, set to pt-PT.
The setting changes punctuation, capitalization, and a few orthographic choices that the 2009 reform left ambiguous (the spelling of "amazônico" vs "amazónico", for example).
Native Script: The Cedilla and the Tildes
Portuguese uses the Latin alphabet plus several diacritics:
- Acute accent (á é í ó ú) for stress.
- Circumflex (â ê ô) for stress with closed vowel.
- Grave (à) for contracted preposition + article ("à" = "a a").
- Tilde (ã õ) for nasal vowels.
- Cedilla (ç) for soft c before a, o, u.
A correct Portuguese transcript preserves every diacritic. If your tool returns "nao" instead of "não", "voce" instead of "você", or "acao" instead of "ação", switch tools. Whisper Large-v3 keeps these characters intact, which is why all Portuguese output from CATT preserves them automatically.
File Workflows for Portuguese Content
Most Portuguese audio comes from podcasts, YouTube channels, interviews, and academic recordings. Different format, same pipeline.
For MP3 podcast files, Portuguese MP3 transcription processes the file in a single pass. Speaker labels and AI summary in Portuguese are included.
For video content (Brazilian YouTubers, Portuguese news clips), use the MP4 to text tool with Portuguese selected. The audio track is extracted before transcription, so accuracy matches audio-only.
If you only need the transcript (not the AI features) on a one-off file, the free Portuguese tier gives you 60 minutes per month with everything included.
Code-Switching Between Portuguese and English
Brazilian content creators frequently slip English tech terms into Portuguese speech ("vou fazer um deploy", "tenho que dar um upgrade no servidor"). This is similar to Spanglish and handled the same way.
CATT auto-detects code-switching at the sentence boundary. Words that are fully integrated loanwords ("internet", "site", "computador") stay in Portuguese phonetic form. Pure English words inserted mid-sentence usually come back correctly spelled in English.
For aggressive in-sentence switching (common in Brazilian tech podcasts), expect a small review pass to clean up the occasional misspelled English word.
AI Summary in Portuguese, Not English
If you publish in Portuguese, your summary should be in Portuguese. Otter and most other tools generate English-only summaries regardless of audio language, which leaves Portuguese-speaking creators with a translation step before they can use the summary.
CATT's AI templates generate summaries in the audio's language. A Brazilian podcast episode yields Portuguese chapter markers, Portuguese pull quotes, and a Portuguese show description. A European Portuguese lecture yields Portuguese study notes. This is the same Whisper Large-v3 + structured output pipeline that powers English summaries, just running in your target language.
Speaker Labels for Portuguese
Diarization works on acoustic features, not language, so Portuguese speaker labels match English accuracy. Expect:
- 2-speaker interview: 92 to 95 percent attribution accuracy.
- 3-to-4 speaker podcast: 80 to 88 percent.
- Group discussion or round-table: 70 to 80 percent.
Brazilian conversational style is more overlap-prone than European Portuguese, so podcasts from São Paulo or Rio benefit from cleaner-mic recordings (one mic per speaker) more than European Portuguese interviews recorded with shared mics.
Tips for Better Portuguese Transcription
- Always set the language to Portuguese (PT) explicitly. Auto-detect is fast but not always right on short clips.
- For known variant (BR or PT), set the regional hint. Improves both transcript and AI summary quality.
- Provide proper-noun glossaries (Brazilian state names, Portuguese place names, brand names). Common AI mistakes include "Camões" (often spelled wrong), "Ceará" (vowel placement), and city names ending in "-ais" or "-aço".
- Record in a low-reverb environment. Portuguese has many unstressed vowels that compress in echo.
- If your audio is mostly slang or regional, expect to review more. AI training data leans toward standard newsroom Portuguese.
Comparing Portuguese Transcription Tools
| Feature | CATT | Otter | Sonix | Trint |
|---|---|---|---|---|
| Portuguese support | Yes (PT-BR + PT-PT) | Yes (PT-BR) | Yes (PT-BR + PT-PT) | Yes (PT-BR + PT-PT) |
| Brazilian accuracy | 94 percent | 90 percent | 91 percent | 89 percent |
| European Portuguese | 92 percent | 85 percent | 89 percent | 87 percent |
| AI summary in Portuguese | Yes | No (English only) | No | No |
| Tilde + cedilla preserved | Yes | Yes | Yes | Yes |
| Free tier | 60 min/month | 300 min/month | 30 min once | None |
| Price | $9.99/mo unlimited | $16.99/mo capped | $10/hr credit | $80/mo (7 files) |
The Portuguese-summary feature is the most overlooked differentiator. If your content stays in Portuguese, your summary should not require translation.
Start with the Free Tier
Before committing to any paid plan, try Portuguese transcription free on your own audio. Sixty minutes per month, both Brazilian and European variants, with AI summary in Portuguese included.
For longer projects, the unlimited $9.99/month plan handles every Portuguese variant. If you also work across languages, see how CATT compares to Otter.ai for multilingual content workflows.
Portuguese Podcast Producer Workflow
The Brazilian podcast scene is one of the largest in the world. Brazilian producers use CATT in this pattern:
- Record the episode (most use MP3 at 192 kbps or higher).
- Upload via Portuguese MP3 transcription.
- Enable speaker labels.
- Apply the podcast episode template for Portuguese chapter markers and show notes.
- Export Portuguese SRT for YouTube auto-captions.
For a moderately busy podcast (2 to 4 episodes weekly, 30 to 90 minutes each), the unlimited plan saves several hundred dollars per month versus per-minute services. Native Portuguese show notes save the manual translation step that English-only tools force on Brazilian creators.
Brazilian Business Use
Brazilian companies running Zoom meetings, sales calls, and webinars in Portuguese benefit from:
- Portuguese transcripts for internal documentation.
- Portuguese AI summary that meets Brazilian regulatory documentation needs.
- Speaker diarization for multi-participant calls.
- Fast turnaround on long meetings (2-hour board meeting transcribed in 4 to 6 minutes).
For Brazilian companies with international partners, the unlimited plan also covers any English content the team produces.
Try transcription free
Convert any audio or video to accurate text in seconds. Speaker labels, timestamps, and AI summaries included. No account required.
Related Articles

Arabic Transcription: MSA vs Egyptian, Gulf, Maghrebi Dialects
Transcribe Arabic audio across Modern Standard Arabic, Egyptian, Gulf, Levantine, and Maghrebi dialects. Right-to-left script, diacritics, and the best AI tool in 2026.

Chinese Mandarin Transcription: Simplified, Traditional, and Tones
Transcribe Mandarin audio with proper simplified or traditional characters. Tones, regional accents, taiwanese mandarin, and the right AI tool for Chinese in 2026.