Technology

Web Speech API

Integrate voice input and output into web applications using the `SpeechRecognition` (Speech-to-Text) and `SpeechSynthesis` (Text-to-Speech) interfaces.

The Web Speech API delivers robust voice functionality via two core interfaces: `SpeechRecognition` and `SpeechSynthesis`. Use `SpeechRecognition` to process live audio input (from a microphone) into a text string, enabling voice commands or dictation. The recognition service can operate server-side or on-device, offering flexibility for privacy and performance. Conversely, the `SpeechSynthesis` interface handles Text-to-Speech (TTS): it queues `SpeechSynthesisUtterance` objects for speaking, allowing developers to control critical parameters (voice selection, pitch, volume). This dual capability is critical for building accessible interfaces and hands-free application control.

https://developer.mozilla.org/en-US/docs/Web/API/Web_Speech_API

2 projects · 2 cities

Related technologies

Amazon Translate 1 Claude API 17 Desktop Application 1 Fairseq 1 FastAPI 160 Google Cloud Translation API 1 Marian NMT 1 memory management 2 Microsoft Translator 1 OpenNMT 1 Speech Translation 1 Transformer 11 Video Conferencing 1 Voice Streaming 1

Recent Talks & Demos

Showing 1-2 of 2

Members-Only

Speech Translation Google Cloud Translation API