Technology

IBM Watson Speech to Text

IBM Watson Speech to Text converts audio into accurate text using deep learning AI for real-time transcription and voice-driven applications.

Watson Speech to Text leverages sophisticated neural network models to transcribe audio in 20+ languages (including dialects like Brazilian Portuguese and Modern Standard Arabic). It handles low-quality telephony audio at 8kHz and high-fidelity 16kHz streams with equal precision. Developers use its robust API to automate contact center analytics, generate live captions, and enable voice commands. Key features include speaker diarization (identifying who said what), smart formatting for dates and currencies, and custom acoustic models that adapt to specific industry jargon or noisy environments.

https://www.ibm.com/products/speech-to-text

2 projects · 2 cities

Related technologies

Amazon Transcribe 2 BERT 179 CMU Sphinx 2 Google Cloud Speech-to-Text 2 GPT-3 191 GPT-4 528 Kaldi 2 Amazon Polly 1 Azure Speech to Text 1 BLOOM 115 Database 8 DeepSpeech 1 eSpeak 1 Festival 1 Google Cloud Text-to-Speech 1 IBM Watson Text to Speech 1 Keras 74 Llama-2 227

Recent Talks & Demos

Showing 1-2 of 2

Members-Only

UzbekVoice

Tashkent Oct 31

Google Cloud Speech-to-Text Google Cloud Text-to-Speech

YT shorts finder

Austin Sep 12

Vision models Whisper