Technology

Google Cloud Speech-to-Text

Google Cloud Speech-to-Text delivers high-accuracy transcription: convert audio to text via API, supporting over 125 languages and variants.

This is your engine for converting spoken audio to text, leveraging Google’s advanced AI, including the Chirp 3 foundation model. The service offers three primary recognition methods: Synchronous (for audio under 1 minute), Asynchronous (for files up to 480 minutes), and Streaming (for real-time applications). We support over 125 languages and dialects globally. Integrate the API for robust features like speaker diarization, automatic punctuation, and custom model adaptation to boost accuracy for domain-specific vocabulary.

https://cloud.google.com/speech-to-text

2 projects · 2 cities

Related technologies

Amazon Transcribe 2 BERT 179 CMU Sphinx 2 GPT-3 191 GPT-4 528 IBM Watson Speech to Text 2 Kaldi 2 Amazon Polly 1 Azure Speech to Text 1 BLOOM 115 Database 8 DeepSpeech 1 eSpeak 1 Festival 1 Google Cloud Text-to-Speech 1 IBM Watson Text to Speech 1 Keras 74 Llama-2 227

Recent Talks & Demos

Showing 1-2 of 2

Members-Only

UzbekVoice

Tashkent Oct 31

Google Cloud Speech-to-Text Google Cloud Text-to-Speech

YT shorts finder

Austin Sep 12

Vision models Whisper