
AWS Transcribe Pricing Per Minute in 2026 (Standard, Medical, Call Analytics)
What Does AWS Transcribe Actually Cost?
AWS Transcribe is Amazon's cloud-based speech-to-text service, and its pricing model follows the familiar AWS pattern: pay-per-use with no upfront commitments. The base rate is $0.024 per minute for standard batch transcription, but the final number on your invoice depends heavily on which tier you use, whether you need real-time streaming, and how much audio you process each month.
For teams already embedded in the AWS ecosystem, Transcribe can be a natural extension of existing infrastructure. For everyone else, the pricing deserves careful scrutiny. This guide walks through every tier, surcharge, and volume scenario so you can model your actual costs before committing.
If you are evaluating multiple speech-to-text APIs side by side, our comprehensive STT pricing comparison covers all major providers in one place.
AWS Transcribe Pricing Breakdown
AWS Transcribe is not a single product with a single price. It ships as three distinct tiers, each targeting a different workload, plus optional add-ons that increase the per-minute cost. Understanding the tier structure is the first step to accurate budgeting.
Tier-by-Tier Pricing Table
| Tier | Batch (per minute) | Streaming (per minute) | Primary Use Case |
|---|---|---|---|
| Standard | $0.024 | $0.0288 | General-purpose transcription |
| Medical | $0.0480 | $0.0504 | HIPAA-compliant clinical documentation |
| Call Analytics | $0.0480 | $0.0480 | Contact center post-call and real-time analysis |
| Toxicity Detection | +$0.012 (add-on) | +$0.012 (add-on) | Content moderation overlay |
All prices are billed in per-second increments, rounded up to the nearest second. AWS quotes per-minute rates in their documentation, but the actual billing granularity is finer than most competitors, which means you are not overpaying for partial minutes.
Batch vs Real-Time Streaming
Batch transcription is the cheaper option. You upload audio to S3, start a transcription job via the API, and retrieve results once processing completes. Turnaround varies from near-instant for short clips to several minutes for longer files. Batch is best for post-processing workflows where latency is not critical — podcast transcription, meeting archives, media cataloging.
Real-time streaming costs roughly 20 percent more for the Standard tier. You open a WebSocket connection and send audio chunks as they arrive, receiving partial transcripts within seconds. Streaming is essential for live captioning, real-time meeting assistants, and any application where users expect immediate output.
The streaming surcharge is worth factoring into your estimates. A workload that processes 1,000 hours per month at the Standard batch rate costs $1,440. The same volume via streaming jumps to $1,728 — a $288 monthly difference that compounds at scale.
Standard Tier Details
The Standard tier handles general-purpose transcription across 100+ languages and dialects. It includes automatic punctuation, speaker diarization (up to 10 speakers), custom vocabulary, vocabulary filtering, and automatic language identification.
At $0.024 per minute for batch processing, Standard sits in the mid-range of the cloud STT market. It is more expensive than Deepgram's base tier but cheaper than Google Cloud Speech-to-Text's premium models. Our Deepgram vs AWS Transcribe comparison breaks down the accuracy and feature differences in detail.
Medical Tier Details
AWS Transcribe Medical is purpose-built for healthcare. It understands medical terminology, drug names, anatomical references, and clinical shorthand that general-purpose models consistently get wrong. It is also HIPAA-eligible, meaning AWS will sign a Business Associate Agreement (BAA) covering transcription data.
At $0.0480 per minute, Medical costs exactly double the Standard tier. That premium buys you domain-specific accuracy that would otherwise require expensive custom vocabulary tuning, plus the compliance infrastructure that healthcare organizations need.
Medical supports two specialties: primary care and cardiology, with more being added over time. If your medical transcription needs fall outside those specialties, test accuracy carefully before committing.
Call Analytics Tier Details
Call Analytics is designed for contact centers. Beyond basic transcription, it provides turn-by-turn sentiment analysis, call categorization, issue detection, and call summarization. It integrates with Amazon Connect but also works with third-party telephony systems.
At $0.0480 per minute (same as Medical), Call Analytics bundles analytics features that would otherwise require separate NLP processing. For contact centers already paying for sentiment analysis and call categorization through other tools, the bundled pricing can represent meaningful savings.
Toxicity Detection Add-On
Toxicity Detection is an overlay that identifies toxic or harmful content in transcriptions. It adds $0.012 per minute on top of the base tier price. For a Standard tier job, that brings the effective rate to $0.036 per minute. For Medical or Call Analytics, it brings the rate to $0.060 per minute.
This add-on is most relevant for user-generated content platforms, moderation workflows, and compliance teams monitoring recorded conversations.

AWS Transcribe Free Tier
AWS offers a free tier for Transcribe that gives new accounts 60 minutes of transcription per month for the first 12 months. This applies to both batch and streaming in the Standard tier. Medical and Call Analytics have their own separate 60-minute free tier allowances.
What the Free Tier Actually Gets You
Sixty minutes per month is enough to evaluate accuracy, test API integration, and run a small proof of concept. It is not enough for any production workload. For context, a single one-hour meeting consumes your entire monthly allowance.
Key limitations to keep in mind:
- New AWS accounts only. If your organization already has an AWS account, the 12-month clock started when that account was created, not when you first use Transcribe.
- Per-account, not per-user. You cannot create multiple AWS accounts to extend the free tier. AWS consolidates billing across linked accounts.
- Automatic billing after exhaustion. There is no hard cap. Once you exceed 60 minutes, AWS charges the standard per-minute rate with no warning unless you set up CloudWatch billing alerts.
- 12-month expiration is firm. After 12 months, you pay full price from minute one.
If you want to test transcription without worrying about billing, ConvertAudioToText offers a free tier with no expiration date and no credit card required.
Cost Calculations at Scale
The per-minute rate only tells part of the story. What matters is the monthly invoice at your actual usage volume. The tables below project costs across three tiers for common workload sizes.
Standard Tier Monthly Costs
| Monthly Volume | Batch Cost | Streaming Cost | Cost per Hour (Batch) |
|---|---|---|---|
| 100 hours | $144 | $172.80 | $1.44 |
| 500 hours | $720 | $864 | $1.44 |
| 1,000 hours | $1,440 | $1,728 | $1.44 |
| 5,000 hours | $7,200 | $8,640 | $1.44 |
| 10,000 hours | $14,400 | $17,280 | $1.44 |
Medical Tier Monthly Costs
| Monthly Volume | Batch Cost | Streaming Cost | Cost per Hour (Batch) |
|---|---|---|---|
| 100 hours | $288 | $302.40 | $2.88 |
| 500 hours | $1,440 | $1,512 | $2.88 |
| 1,000 hours | $2,880 | $3,024 | $2.88 |
| 5,000 hours | $14,400 | $15,120 | $2.88 |
| 10,000 hours | $28,800 | $30,240 | $2.88 |
Call Analytics Monthly Costs
| Monthly Volume | Batch Cost | Streaming Cost | Cost per Hour (Batch) |
|---|---|---|---|
| 100 hours | $288 | $288 | $2.88 |
| 500 hours | $1,440 | $1,440 | $2.88 |
| 1,000 hours | $2,880 | $2,880 | $2.88 |
| 5,000 hours | $14,400 | $14,400 | $2.88 |
| 10,000 hours | $28,800 | $28,800 | $2.88 |
A few patterns stand out. First, AWS Transcribe offers no volume discounts. Whether you process 100 hours or 10,000 hours, the per-minute rate stays flat. This is unusual — most enterprise cloud services offer tiered pricing that rewards scale. Deepgram, Google, and AssemblyAI all provide committed-use discounts at higher volumes.
Second, the Medical and Call Analytics tiers cost exactly double the Standard tier for batch processing. If you are running a mixed workload where only some audio requires medical-grade transcription, routing general audio through the Standard tier and clinical audio through Medical can cut costs significantly.
Third, streaming adds 20 percent to Standard tier costs but almost nothing to Call Analytics. If your primary use case is contact center analytics, the streaming surcharge is negligible.
For a broader comparison of how these numbers stack up against other providers, see our best speech-to-text APIs guide.
The AWS Ecosystem Advantage
Raw per-minute pricing does not capture the full economics of choosing AWS Transcribe. If your infrastructure already runs on AWS, Transcribe plugs into your existing stack with zero data transfer costs and native integrations that reduce engineering time.
S3 Integration
Batch transcription reads directly from S3 buckets and writes results back to S3. There are no data transfer charges for moving audio between S3 and Transcribe within the same region. For organizations already storing audio in S3 — recorded meetings, call center archives, podcast hosting — this eliminates the egress costs that add up when sending data to external transcription APIs.
A typical external API workflow looks like this: download from S3, upload to the transcription provider, download the result. At S3's $0.09/GB egress rate, sending 1 TB of audio out of AWS costs $90 in transfer fees alone. With Transcribe, that cost is zero.
Lambda and EventBridge Automation
AWS Lambda functions can trigger transcription jobs automatically when new audio files land in S3. EventBridge rules can route Transcribe job completion events to downstream services — storing results in DynamoDB, sending notifications via SNS, or kicking off additional processing.
This event-driven architecture is particularly powerful for high-volume workflows. A media company that receives hundreds of audio files daily can fully automate the transcription pipeline without building or maintaining queue infrastructure. The Lambda invocation cost for triggering a Transcribe job is fractions of a cent.
IAM and Security
Transcribe inherits AWS's IAM permission model. You can scope Transcribe access to specific roles, enforce encryption at rest and in transit, audit all API calls through CloudTrail, and satisfy compliance requirements using the same tooling your security team already manages.
For regulated industries, this matters. Building equivalent security controls around an external transcription API requires significant engineering effort — API key rotation, request signing, audit logging, and network isolation all need to be implemented from scratch.
CloudWatch Monitoring
Transcribe publishes metrics to CloudWatch: job counts, processing times, error rates, and billing estimates. You can set alarms for cost thresholds, build dashboards showing transcription throughput, and integrate with existing operational monitoring without adopting a new observability tool.
When the Ecosystem Saves Real Money
The AWS ecosystem advantage is most significant when three conditions align:
- Audio is already in S3. You avoid egress costs that other providers require.
- You need automation around transcription. Lambda, EventBridge, and Step Functions reduce engineering time.
- Compliance requirements are strict. IAM, CloudTrail, and AWS's compliance certifications reduce security engineering overhead.
If all three apply, the effective cost of AWS Transcribe is lower than the per-minute rate suggests, because you are saving on infrastructure, engineering, and compliance costs that would otherwise appear as separate line items.

When AWS Transcribe Is Not the Best Choice
AWS Transcribe is a solid service, but it is not the optimal choice for every team and every workload. Several scenarios favor alternative providers.
Price-Sensitive Startups
At $0.024 per minute with no volume discounts, AWS Transcribe is not the cheapest option available. Deepgram's pay-as-you-go rate starts lower, and several providers offer aggressive committed-use pricing for startups processing significant volume. If transcription cost is a primary concern and you are processing more than a few hundred hours per month, shopping around can save 30 to 50 percent.
Our Deepgram vs AWS Transcribe analysis quantifies the price difference across different volume tiers.
Teams Not Already on AWS
The ecosystem advantages described above only apply if your infrastructure is already on AWS. If you are running on GCP, Azure, or a mix of providers, using Transcribe means adding AWS to your stack — additional billing, IAM configuration, networking, and operational overhead. The transcription itself might be competitively priced, but the total cost of operating another cloud provider is not.
In multi-cloud or non-AWS environments, provider-agnostic APIs like Deepgram, AssemblyAI, or ConvertAudioToText offer simpler integration without vendor lock-in.
Workloads Requiring Maximum Accuracy
AWS Transcribe's accuracy is competitive but not class-leading for all languages and audio conditions. If your workload involves heavy accents, significant background noise, domain-specific jargon outside the Medical tier's specialties, or languages where AWS has limited training data, you may get better results from providers that specialize in those areas.
Always benchmark with your own audio before committing. Accuracy differences of even a few percentage points can significantly impact downstream workflows that depend on transcript quality.
Simple, Low-Volume Use Cases
If you need to transcribe a handful of files per week and do not need programmatic API access, a managed transcription platform is simpler and often cheaper than maintaining AWS infrastructure. ConvertAudioToText handles file uploads, URL-based transcription, and multiple export formats through a web interface with no AWS account, no S3 buckets, and no IAM policies required.
Real-Time Applications Needing Sub-Second Latency
AWS Transcribe's streaming mode delivers results with reasonable latency for most applications, but it is not the fastest option available. If your product requires the absolute lowest latency — live captioning for broadcasts, real-time translation pipelines, or interactive voice applications — specialized providers may deliver faster partial results.
Hidden Costs to Watch For
The per-minute rate is the largest cost component, but several adjacent charges can inflate your AWS Transcribe bill.
S3 Storage Costs
Audio files stored in S3 incur standard S3 storage charges ($0.023/GB/month for S3 Standard). If you are archiving large volumes of audio, storage costs add up. A thousand hours of MP3 audio at 128 kbps is roughly 56 GB, costing about $1.29 per month in storage. Not significant at small scale, but it scales linearly.
Custom Vocabulary and Language Model Costs
Creating and maintaining custom vocabularies is free, but the engineering time to build, test, and update them is not. If your domain requires extensive custom vocabulary work to reach acceptable accuracy, factor that labor cost into your total cost of ownership.
Data Transfer Between Regions
S3-to-Transcribe within the same region is free. Cross-region data transfer is not. If your audio is in us-east-1 but you run Transcribe in eu-west-1 for data residency reasons, inter-region transfer charges apply at $0.02/GB.
CloudWatch Logs and Metrics
Transcribe outputs to CloudWatch, which charges for log ingestion ($0.50/GB), log storage ($0.03/GB/month), and custom metrics. High-volume workloads can generate meaningful CloudWatch costs if logging is not configured carefully.
Comparing AWS Transcribe to Alternatives
To put AWS Transcribe pricing in context, here is how it compares to other major STT providers at the Standard tier level.
| Provider | Batch Rate (per min) | Streaming Rate (per min) | Volume Discounts | Free Tier |
|---|---|---|---|---|
| AWS Transcribe | $0.024 | $0.0288 | No | 60 min/mo (12 months) |
| Google Cloud STT | $0.016 - $0.048 | $0.022 - $0.060 | Yes (tiered) | 60 min/mo |
| Deepgram | $0.0145 | $0.0160 | Yes (committed use) | $200 credit |
| AssemblyAI | $0.015 | $0.020 | Yes (enterprise) | 100 hours |
| Azure Speech | $0.016 | $0.020 | Yes (committed) | 5 hours/mo |
AWS Transcribe sits in the middle of the pack on raw per-minute cost. Its competitive advantage is ecosystem integration, not price leadership. For a deep dive into all of these providers, see our full pricing comparison.
Optimizing Your AWS Transcribe Costs
If you have decided that AWS Transcribe is the right fit, several strategies can reduce your monthly bill.
Use Batch When Possible
Streaming costs 20 percent more for Standard tier. If your workflow can tolerate a few seconds of latency, batch processing is always cheaper. Many applications that seem to require real-time transcription can actually use near-real-time batch processing with short audio segments.
Route Audio to the Right Tier
Do not send general-purpose audio through the Medical or Call Analytics tiers. If your workload mixes clinical and non-clinical audio, implement routing logic that sends each file to the appropriate tier.
Compress Audio Before Transcription
Transcribe supports common audio formats including MP3, FLAC, WAV, and OGG. Sending compressed formats (MP3 at 128 kbps instead of uncompressed WAV) reduces S3 storage costs and upload times without affecting transcription quality. Transcribe bills by audio duration, not file size, so compression is pure savings on the storage and transfer side.
Set Up Billing Alerts
Configure CloudWatch billing alerts to catch unexpected usage spikes before they become expensive surprises. Set alerts at 50 percent, 80 percent, and 100 percent of your expected monthly spend.
Clean Up S3 After Processing
If you do not need to retain source audio after transcription, set S3 lifecycle rules to delete or archive files to S3 Glacier after processing completes. This prevents storage costs from accumulating indefinitely.
Frequently Asked Questions
How much does AWS Transcribe cost per minute?
AWS Transcribe Standard tier costs $0.024 per minute for batch transcription and $0.0288 per minute for real-time streaming. The Medical tier and Call Analytics tier both cost $0.0480 per minute for batch processing. Toxicity Detection adds $0.012 per minute on top of any tier. All billing is calculated in per-second increments. You can verify current pricing on the official AWS Transcribe pricing page.
Is there a free tier for AWS Transcribe?
Yes. New AWS accounts receive 60 minutes of free transcription per month for the first 12 months. This applies separately to the Standard, Medical, and Call Analytics tiers. After the 12-month period expires or after you exceed 60 minutes in a given month, standard per-minute rates apply automatically. There is no free tier for existing accounts beyond the initial 12-month window.
Does AWS Transcribe offer volume discounts?
No. Unlike most competing services, AWS Transcribe charges the same per-minute rate regardless of volume. Whether you transcribe 10 minutes or 10,000 hours per month, the rate remains $0.024 per minute for the Standard tier. If volume discounts are important to your cost model, providers like Deepgram, Google Cloud, and Azure Speech all offer committed-use or tiered pricing that decreases at higher volumes.
How does AWS Transcribe pricing compare to Google Cloud Speech-to-Text?
Google Cloud Speech-to-Text uses tiered pricing that starts at $0.016 per minute for standard models and increases for premium models with enhanced accuracy. At low volumes, Google's standard model is cheaper than AWS Transcribe. At high volumes, Google offers additional discounts. AWS Transcribe's advantage is tighter integration with other AWS services, particularly S3, Lambda, and IAM, which can reduce total infrastructure costs for teams already on AWS. See AWS Transcribe features for a full feature comparison.
Can I use AWS Transcribe for HIPAA-compliant medical transcription?
Yes. The AWS Transcribe Medical tier is HIPAA-eligible, and AWS will sign a Business Associate Agreement (BAA) covering Transcribe Medical data. The Medical tier costs $0.0480 per minute and is specifically trained on medical terminology for primary care and cardiology specialties. Standard tier Transcribe is not HIPAA-eligible, so healthcare organizations must use the Medical tier and configure their AWS account according to AWS HIPAA compliance guidelines.
Try transcription free
Convert any audio or video to accurate text in seconds. Speaker labels, timestamps, and AI summaries included. No account required.
Related Articles

What Are the Cost Advantages of an All-in-One API Like Deepgram?
Discover why all-in-one speech APIs like Deepgram save money compared to multi-vendor approaches. Covers hidden costs, TCO analysis, bundled features, and when single-vendor wins.

Deepgram vs AWS Transcribe: Which is Cheaper and More Accurate in 2026?
Head-to-head comparison of Deepgram and AWS Transcribe in 2026. Compare pricing at scale, accuracy benchmarks, streaming latency, language support, and developer experience.