.

Technology

OpenAI Whisper

Whisper is OpenAI's robust, open-source Automatic Speech Recognition (ASR) system, trained on 680,000 hours of diverse audio.

This is Whisper: a high-performance, general-purpose ASR model from OpenAI. It was trained on a massive 680,000 hours of multilingual, multitask data, resulting in exceptional robustness against accents, background noise, and technical language. The model is a Transformer sequence-to-sequence architecture, engineered for multiple tasks: multilingual transcription, speech-to-English translation, and language identification. Developers leverage the open-source code and various model sizes (tiny, base, small, medium, large) to balance transcription speed with near human-level accuracy for diverse applications.

https://github.com/openai/whisper
14 projects · 13 cities

Related technologies

Recent Talks & Demos

Showing 1-14 of 14

Members-Only

Sign in to see who built these projects