Technology
OpenAI Whisper
Whisper is OpenAI's robust, open-source Automatic Speech Recognition (ASR) system, trained on 680,000 hours of diverse audio.
This is Whisper: a high-performance, general-purpose ASR model from OpenAI. It was trained on a massive 680,000 hours of multilingual, multitask data, resulting in exceptional robustness against accents, background noise, and technical language. The model is a Transformer sequence-to-sequence architecture, engineered for multiple tasks: multilingual transcription, speech-to-English translation, and language identification. Developers leverage the open-source code and various model sizes (tiny, base, small, medium, large) to balance transcription speed with near human-level accuracy for diverse applications.
Related technologies
Recent Talks & Demos
Showing 1-14 of 14