Speech-to-Text Projects

Technology

Speech-to-Text

Speech-to-Text (STT) instantly converts spoken audio into written text: it’s the core engine for voice assistants like Alexa and real-time captioning across 125+ languages.

Speech-to-Text, formally Automatic Speech Recognition (ASR), leverages sophisticated deep learning models to transform human speech into a digital text format. This technology powers critical enterprise applications: transcribing contact center calls, generating subtitles for live media, and enabling voice commands for smart devices. Major providers, including Google Cloud and Amazon Transcribe, offer APIs with high accuracy (often 95%+) and features like speaker diarization and custom vocabulary, making voice data actionable across nearly every industry.

https://cloud.google.com/speech-to-text

9 projects · 8 cities

Related technologies

Text-to-Speech 16 RAG 138 BERT 179 GPT-4 528 RoBERTa 118 3D AI 1 BLOOM 115 Claw 1 Conversational AI 5 Deepgram 11 faster-whisper 3 Fine-tuning 20 Generative AI 45 GitHub API 3 GitHub API – repository activity 1 GPT-3 191 GPT-4o 56 Llama-2 227

Recent Talks & Demos

Showing 1-9 of 9

Members-Only

Sign in to see who built these projects

Sign in View FAQ

GitHub API Claw

VOZY: Voice AI Sales Agent

Medellín Apr 1

Speech-to-Text Text-to-Speech

Conversational RAG chatbot

Python Speech-to-Text

Beyond Presence: Hyper-Realistic Avatars

Voice-to-Voice AI Building Blocks

San Francisco Aug 21

Deepgram Voice AI Platform

Deepgram Speech-to-Text

LuminaLog: AI Journaling Companion

Palo Alto Jun 11

Speech-to-Image Fun

faster-whisper Stable Diffusion

1B+ Speech-Text LLM Training

Speech-to-Text Fine-tuning