.

Technology

GPT-2

GPT-2 is a 1.5 billion-parameter, transformer-based language model from OpenAI (2019), trained on 40GB of internet text (WebText) to predict the next word, demonstrating strong zero-shot performance across diverse tasks.

GPT-2 (Generative Pre-trained Transformer 2) is a large, transformer-based language model developed by OpenAI, first released in a staged manner starting in 2019. The largest version features 1.5 billion parameters and was trained on a massive 40GB dataset called WebText, sourced from 8 million web pages. Its core objective was simple: predict the next word in a sequence. This simple training goal resulted in a powerful model capable of generating high-quality, coherent conditional synthetic text. Critically, GPT-2 demonstrated a remarkable ability to perform multiple downstream tasks—including summarization, translation, and question answering—in a 'zero-shot' setting, meaning it required no task-specific training data to achieve state-of-the-art results at the time.

https://openai.com/blog/better-language-models-and-their-implications
204 projects · 71 cities

Related technologies

Recent Talks & Demos

Showing 41-64 of 204

Members-Only

Sign in to see who built these projects

LLM Failover Chains and Redis
Nashville Jan 29
GPT-4o Claude
Nihongo Convo: AI Conversation Practice
Nashville Jan 29
gpt-4o-mini gpt-4o-mini-tts
Multi-Model Imposter Game
Nashville Jan 29
Python FastAPI
Mori Solution: Construction RAG Pipeline
Nashville Jan 29
GPT-4 React
JobsYo: Multi-Model Job Search AI
Toronto Jan 29
GPT-5 Gemini 3
Human-ish: LinkedIn AI Detector
Toronto Jan 29
GPTZero GPT5
Consistent Pictogram Generation for AAC
Valencia Jan 29
Flutter Replicate
Zensei: Interpretable Market Regimes
Valencia Jan 29
Claude Codex
NetShow IQ1: AI Business Factory
San Diego Jan 22
Anthropic Claude Google Gemini
Finetuning with Claude Synthetic Data
Cologne Jan 21
Claude GPT-5
Synthesizing Contextual Agentic Assistants
Manchester Nh Jan 20
Vercel AI SDK ChatGPT
JP-TL-Bench: AI Paper Writing
Tokyo Jan 15
Claude Code Opus 4
NetShow IQ1: English Business Factory
Orange County Jan 14
GPT Anthropic Claude
Agent Runner
Seattle Jan 12
GPT-4 Claude 3 Opus
Forge: Multi-Agent Code Fixes
Seattle Jan 12
Claude Opus Python
Apple AI in Shortcuts
Seattle Dec 18
iOS Shortcuts iOS 26
PRESENT: Voice Steward Architecture
Seattle Dec 18
Next LiveKit
VibeVoice Realtime: Mac Metal TTS
Seattle Dec 18
GPT-4 LangChain
ChatGPT App Gotchas
Seattle Dec 18
GPT-4 Claude-3
Chisme: AI Quiz Generator
Waterloo Dec 15
GPT-4o Python
Muse: Playful Voice Coding
Sydney Dec 11
Next TypeScript
Inklu-Connect JobSync
Bremen Dec 10
OpenRouter API GPT-5 Nano
Agentic ChatGPT Apps: MCP UI
Paris Dec 9
Vite TypeScript
Vibe Scaffold: Specs for AI Agents
Seattle Dec 8
OpenAI API GPT-4o