OpenAI Whisper Projects

Technology

OpenAI Whisper

Whisper is OpenAI's robust, open-source Automatic Speech Recognition (ASR) system, trained on 680,000 hours of diverse audio.

This is Whisper: a high-performance, general-purpose ASR model from OpenAI. It was trained on a massive 680,000 hours of multilingual, multitask data, resulting in exceptional robustness against accents, background noise, and technical language. The model is a Transformer sequence-to-sequence architecture, engineered for multiple tasks: multilingual transcription, speech-to-English translation, and language identification. Developers leverage the open-source code and various model sizes (tiny, base, small, medium, large) to balance transcription speed with near human-level accuracy for diverse applications.

https://github.com/openai/whisper

10 projects · 9 cities

Related technologies

Llama 3 38 PyTorch 265 React 194 Amazon Polly 1 Amazon Rekognition 1 Amazon S3 7 Amazon Transcribe 2 Azure AI Service 1 Azure Speech Service 2 Azure Speech to Text 1 BERT 179 Claude 145 Claude-3 110 CMU Sphinx 2 Coqui TTS 1 Demucs 1 Docker 128 eSpeak 1

Recent Talks & Demos

Showing 1-10 of 10

Members-Only

Sign in to see who built these projects

Sign in View FAQ

Reachy Mini: OpenClaw Brain

New York City Feb 17

OpenClaw Honcho

Public Speaking AI Agent

Tiruchirappalli Jan 31

React TypeScript

Xing Xing: Efficient AI Music

Zettaware: Language Model Motion Planning

Claude-3 Sonnet

Rekognition Twelve Labs NASCAR Indexing

Twelve Labs Amazon S3

AI Speech Input for Windows Apps

OpenAI Whisper Azure Speech Service

Aura: Local AI Gaming Companion

Llama 3 OpenAI Whisper

Llama 3 OpenAI Whisper

AI Call Analyst

Medellín Jan 29

OpenAI Whisper OpenAI API

Tashkent Oct 31

Google Cloud Speech-to-Text Google Cloud Text-to-Speech