Category
Speech-to-text, meeting transcription, real-time captions, audio-to-text APIs, and voice intelligence platforms.
Amazon Web Services
AWS's managed STT — deepest AWS ecosystem integration, HIPAA-eligible, call analytics, and medical model.
AssemblyAI
Speech AI platform for transcription and audio intelligence.
Deepgram
Enterprise speech-to-text and voice AI platform.
Fathom Video
The best free AI notetaker — unlimited Zoom, Meet, and Teams recording with zero meeting length caps.
Fireflies.ai
AI meeting assistant for recording, transcribing, summarizing, and analyzing conversations.
Gladia
#1 async STT accuracy in 2026 — Solaria-1 with 29% lower WER, 100+ languages, EU data residency.
Google's enterprise STT API — Chirp 3 HD with 125+ languages, streaming and batch, GCP ecosystem.
Grain
Meeting intelligence for coaching and sales enablement — clip moments, create highlight reels, CRM sync.
MeetGeek
Meeting intelligence with structured AI summaries, 7,000+ integrations, and the best analytics per dollar.
NoteGPT
AI learning assistant for summarizing YouTube videos, PDFs, and documents into notes and mind maps.
OpenAI
The most accurate batch STT API — GPT-4o Transcribe at 8.9% WER, open-source Whisper for self-hosting.
Otter.ai
AI meeting assistant with transcription and note-taking.
Rev
Speech-to-text API from Rev for transcription and captioning.
VEED (acquired)
Studio-quality podcast and video recording with local track capture, AI transcription, and clip export.
Sonix
File-upload transcription for media, research, and enterprise — 53+ languages, SOC 2, and HIPAA.
Speechmatics
Multilingual speech recognition and transcription platform.
SuperUltra, Inc.
Lightning-fast AI voice dictation and transcription application for macOS and iOS.
tl;dv
Meeting recorder with AI clips, multi-meeting search, and CRM integrations — strong free tier.
Wispr
Enterprise-grade AI voice keyboard and cross-platform dictation tool built for optimal speed.