Technology

GPT-4o

GPT-4o (omni) is OpenAI's flagship multimodal model: it delivers GPT-4 intelligence with native, real-time processing across text, audio, and vision.

This is GPT-4o, OpenAI’s 'omni' model: a single neural network natively handling text, audio, and image inputs and outputs. It matches GPT-4 performance on English text and code, but surpasses it on non-English language, vision, and audio benchmarks. The speed is a major upgrade: it achieves human-level responsiveness in voice, with an average response time of 0.32 seconds (a significant jump from GPT-4’s 5.4 seconds). Developers get a 128K token context window and a model that is more cost-efficient than its predecessor, making high-intelligence, real-time applications viable.

https://openai.com/index/hello-gpt-4o

72 projects · 40 cities

Related technologies

OpenAI API 500 Python 739 Gemini 254 Claude 383 Next 197 FastAPI 159 GPT-4 678 LangChain 439 OpenAI 340 Claude-3 443 GPT-3 390 gpt-4o-mini 10 React 260 TypeScript 259 Flutter 27 GitHub 151 llama 136 Node 142

Recent Talks & Demos

Showing 61-72 of 72

Members-Only

Sign in to see who built these projects

Sign in View FAQ

Voice-to-Voice AI Building Blocks

San Francisco Aug 21

Aider: AI Pair Programming

Fort Wayne Aug 20

SAP: SOTA Function Calling Reliability

Auto-create Reliable LLM Evals

New York City Jul 24

SciPub+: AI Writing Assistants

Fine-tuning LLMs for Function Calling

Amsterdam Jun 20

GPT-4o function calling

Ultravox: Open Source Speech LM

Low Latency AI Agents

GPT-4o OpenAI API

GPT-4o Snoop Hawk

Auggie: GPT-4o Windows Native App