Technology
Web Speech API
Integrate voice input and output into web applications using the `SpeechRecognition` (Speech-to-Text) and `SpeechSynthesis` (Text-to-Speech) interfaces.
The Web Speech API delivers robust voice functionality via two core interfaces: `SpeechRecognition` and `SpeechSynthesis`. Use `SpeechRecognition` to process live audio input (from a microphone) into a text string, enabling voice commands or dictation. The recognition service can operate server-side or on-device, offering flexibility for privacy and performance. Conversely, the `SpeechSynthesis` interface handles Text-to-Speech (TTS): it queues `SpeechSynthesisUtterance` objects for speaking, allowing developers to control critical parameters (voice selection, pitch, volume). This dual capability is critical for building accessible interfaces and hands-free application control.
Related technologies
Recent Talks & Demos
Showing 1-2 of 2