.

Technology

GPT-2

GPT-2 is a 1.5 billion-parameter, transformer-based language model from OpenAI (2019), trained on 40GB of internet text (WebText) to predict the next word, demonstrating strong zero-shot performance across diverse tasks.

GPT-2 (Generative Pre-trained Transformer 2) is a large, transformer-based language model developed by OpenAI, first released in a staged manner starting in 2019. The largest version features 1.5 billion parameters and was trained on a massive 40GB dataset called WebText, sourced from 8 million web pages. Its core objective was simple: predict the next word in a sequence. This simple training goal resulted in a powerful model capable of generating high-quality, coherent conditional synthetic text. Critically, GPT-2 demonstrated a remarkable ability to perform multiple downstream tasks—including summarization, translation, and question answering—in a 'zero-shot' setting, meaning it required no task-specific training data to achieve state-of-the-art results at the time.

https://openai.com/blog/better-language-models-and-their-implications
204 projects · 71 cities

Related technologies

Recent Talks & Demos

Showing 21-44 of 204

Members-Only

Sign in to see who built these projects

OpenClaw Agent
Bremen Mar 25
OpenClaw OpenAI ChatGPT
AI-to-USD: Self-Correcting Industrial Scenes
New York City Mar 18
Gemini-2 Flash
metaMe: Dynamic AI Interfaces
New York City Mar 18
GPT-5 Claude Opus
The Midnight Duck: AI Growth
Valencia Mar 17
ChatGPT Gemini
DEIP.app: Digital Ecosystem Intelligence
Valencia Mar 17
Supabase React
SetForMoney: AI Expense Tracking
Upstate NY Mar 10
GPT-5 Whisper
Ghidra: AI Reverse Engineering
Poland Mar 4
Ghidra Claude Code
AI Council: Replacing Consultants
Columbus Mar 2
Gemini ChatGPT
Words to World: AI Models
San Diego Feb 26
Unreal Engine 5 PyTorch
Werewolf Arena: LLM Agent Benchmark
Bogotá Feb 26
GPT-4o Python
Fluo: Adaptive Multi-Agent Language Learning
Toronto Feb 26
FastAPI SQLAlchemy
Meta-Modeling Drug Discovery
DC Feb 24
Python PyTorch
SQAAILab: AI-Augmented QA Workflows
Montreal Feb 24
Claude Opus ChatGPT
Ship-shape: AI Drift
Manchester Nh Feb 18
Anthropic Claude Vercel AI SDK
Legala: Beyond the Chat Box
Valencia Feb 17
Claude ChatGPT
Sensora
Conakry Feb 11
GPT-4 Expo
LLM Data Cleaning Production Pipeline
Dhaka Feb 7
Claude GPT-4o
Infera: LangGraph Stock Market Agent
St Louis Feb 4
GPT-4 LangGraph
Public Speaking AI Agent
Tiruchirappalli Jan 31
React TypeScript
OpenCode Local Models on DGX
Seattle Jan 30
OpenCode NVIDIA DGX Spark
LLM Failover Chains and Redis
Nashville Jan 29
GPT-4o Claude
Nihongo Convo: AI Conversation Practice
Nashville Jan 29
gpt-4o-mini gpt-4o-mini-tts
Multi-Model Imposter Game
Nashville Jan 29
Python FastAPI
Mori Solution: Construction RAG Pipeline
Nashville Jan 29
GPT-4 React