Technology

GPT-2

GPT-2 is a 1.5 billion-parameter, transformer-based language model from OpenAI (2019), trained on 40GB of internet text (WebText) to predict the next word, demonstrating strong zero-shot performance across diverse tasks.

GPT-2 (Generative Pre-trained Transformer 2) is a large, transformer-based language model developed by OpenAI, first released in a staged manner starting in 2019. The largest version features 1.5 billion parameters and was trained on a massive 40GB dataset called WebText, sourced from 8 million web pages. Its core objective was simple: predict the next word in a sequence. This simple training goal resulted in a powerful model capable of generating high-quality, coherent conditional synthetic text. Critically, GPT-2 demonstrated a remarkable ability to perform multiple downstream tasks—including summarization, translation, and question answering—in a 'zero-shot' setting, meaning it required no task-specific training data to achieve state-of-the-art results at the time.

https://openai.com/blog/better-language-models-and-their-implications

205 projects · 71 cities

Related technologies

GPT-4 678 Python 739 ChatGPT 96 OpenAI API 500 LangChain 439 GPT-4o 72 Claude 384 Gemini 254 React 260 GPT-5 200 Next 197 FastAPI 159 Supabase 93 TypeScript 259 PostgreSQL 144 PyTorch 264 Transformers 168 Claude-3 444

Recent Talks & Demos

Showing 181-204 of 205

Members-Only

Sign in to see who built these projects

Sign in View FAQ

Multi-Agent ML LeetCode Generator

GPT-4 OpenAI API

Automated Evals for LLM Agents

Amsterdam Apr 2

GPT-4 OpenAI API

AI Product Manager: ADHD Assistant

Los Angeles Apr 1

GPT-4 LangChain

AI Agent for DIAN Compliance

Medellín Apr 1

HandIt.ai: Self-Improving AI Systems

Medellín Apr 1

LLM CV Scoring and Analysis

Medellín Apr 1

Influencer Marketing: Agent Automation

AI Game Master: Bootstrap Success

Python LLM Streaming Workflow

GPT-4o Claude-3

Docs2ai: Copier AI Automation

Guatemala City Mar 24

LangChain Azure Document AI

Mobile Agent: Vision Operator

GPT-4o OmniParser v2

Montreal Mar 12

Orchestrating AI Agents for Building Performance: LangGraph, RAG & Re…

Montreal Mar 12

LangChain LangGraph

Aigent Z: iQube Agent Orchestration

New York City Mar 4

LangChain DB-GPT

Jammy: AI Mood Music

New York City Mar 4

Hume AI EVI 2 GPT-4o

Semantic Web Scraping

GPT-4 LangChain

ChatGPT Streamlit

Agent Laboratory: AI Research Agents

Multimodal AI: Analytical Comparison

GPT-4o Claude-3

AI Startup Scout

GPT-4o Streamlit

Ollama Groq Local Inference

Manizales Jan 22

Llama-2 Mistral

Building GPT-2 LLMs from scratch

GPT-2 GPT models

Spreadsheets: Browser GPT-2 Implementation

GPT-2 JavaScript

GraphRAG: Improving RAG Accuracy