Whisper Projects

Technology

Whisper

Whisper: OpenAI's robust, open-source ASR model for multilingual speech recognition, translation, and language identification.

Whisper is OpenAI's general-purpose Automatic Speech Recognition (ASR) model, trained on a massive, diverse dataset for high-accuracy performance. It functions as a powerful multitasking system: handling multilingual transcription, direct speech translation, and language identification. The architecture processes audio in a sliding 30-second window, performing autoregressive predictions. Developers can select from six distinct model sizes to optimize for specific speed versus accuracy tradeoffs: this is the go-to solution for reliable, large-scale audio processing.

https://github.com/openai/whisper

25 projects · 18 cities

Related technologies

GPT-4 528 OpenAI API 509 FastAPI 160 FFmpeg 14 Gemini 178 BERT 179 BLOOM 115 GPT-3 191 GPT-5 25 LiveKit 10 llama 40 Llama-2 227 Next 170 Ollama 71 OpenAI 103 PaLM 2 116 PostgreSQL 94 Python 618

Recent Talks & Demos

Showing 1-24 of 25

Members-Only

Sign in to see who built these projects

Sign in View FAQ

On-premise AI solution for Cloud PBX provider

Valencia Apr 21

Shablon: Programmatic Video Templates

Budget by Chatting: Building a multi-channel AI-powered expense track…

Upstate NY Mar 10

Mebot AI: Multimodal Digital Twins

OpenAI API Gemini

Fluo aka we have Duolingo at home

FastAPI SQLAlchemy

LLM Nutrition Pipeline Architecture

VibeCoding Workflow Demo

Claude Code Gemini CLI

Zulu.cash: Private Local AI Agent

Tally: Ambient AI Continuous Memory

Readback: AI ATC Training

Local Transcription

Valencia Oct 22

ORION: ROS 2 ESP32 Robot

ROS 2 Jazzy micro-ROS

Rafiki AI Tutor

Real-Time Voice Agents

llama OpenAI API

Transcriber R&D project

San Francisco Feb 27

Instantcasts: Fast Whisper Transcripts

San Francisco Jan 29

Real-Time LLaMA Voice Assistant

Medellín Dec 5

Automated Video Editing with LLMs

Amsterdam Nov 12

Whisper Clipboard

Amsterdam Sep 25

YT shorts finder

Vision models Whisper

Bengaluru Aug 8

GPT-4 OpenAI API

GGML ONNX Runtime

Whisper/VAD Multi-Model Segmentation

Whisper WebRTC-VAD

Capsule Transcriber: ML Transcription

Los Angeles Mar 19

Whisper WebRTC-VAD