.

Technology

Speech synthesis

Speech synthesis (Text-to-Speech or TTS) is the computational process of artificially producing human speech from written text.

Speech synthesis is the core technology behind Text-to-Speech (TTS) systems: it converts linguistic input into audible, human-like speech. Modern systems leverage Deep Neural Networks (DNNs) like Google’s WaveNet or generative models to move beyond older concatenative methods, achieving high naturalness and intelligibility. The process involves a front-end (text analysis, prosody generation) and a back-end (acoustic modeling and vocoder for waveform generation). This tech powers critical applications: screen readers for accessibility, virtual assistants like Amazon Alexa, and dynamic content narration, delivering high-quality audio at scale.

https://en.wikipedia.org/wiki/Speech_synthesis
2 projects · 2 cities

Related technologies

Recent Talks & Demos

Showing 1-2 of 2

Members-Only

Sign in to see who built these projects