.

Technology

GPT-2

GPT-2 is a 1.5 billion-parameter, transformer-based language model from OpenAI (2019), trained on 40GB of internet text (WebText) to predict the next word, demonstrating strong zero-shot performance across diverse tasks.

GPT-2 (Generative Pre-trained Transformer 2) is a large, transformer-based language model developed by OpenAI, first released in a staged manner starting in 2019. The largest version features 1.5 billion parameters and was trained on a massive 40GB dataset called WebText, sourced from 8 million web pages. Its core objective was simple: predict the next word in a sequence. This simple training goal resulted in a powerful model capable of generating high-quality, coherent conditional synthetic text. Critically, GPT-2 demonstrated a remarkable ability to perform multiple downstream tasks—including summarization, translation, and question answering—in a 'zero-shot' setting, meaning it required no task-specific training data to achieve state-of-the-art results at the time.

https://openai.com/blog/better-language-models-and-their-implications
205 projects · 71 cities

Related technologies

Recent Talks & Demos

Showing 181-204 of 205

Members-Only

Sign in to see who built these projects

Multi-Agent ML LeetCode Generator
Nairobi Apr 9
GPT-4 OpenAI API
Automated Evals for LLM Agents
Amsterdam Apr 2
GPT-4 OpenAI API
AI Product Manager: ADHD Assistant
Los Angeles Apr 1
GPT-4 LangChain
AI Agent for DIAN Compliance
Medellín Apr 1
ChatGPT Node
HandIt.ai: Self-Improving AI Systems
Medellín Apr 1
GPT-4 Claude-3
LLM CV Scoring and Analysis
Medellín Apr 1
GPT-4o n8n
Influencer Marketing: Agent Automation
Lausanne Apr 1
ChatGPT Claude
AI Game Master: Bootstrap Success
Seattle Mar 27
ChatGPT DALL-E
Python LLM Streaming Workflow
Boston Mar 24
GPT-4o Claude-3
Docs2ai: Copier AI Automation
Guatemala City Mar 24
LangChain Azure Document AI
Mobile Agent: Vision Operator
Delhi Mar 22
GPT-4o OmniParser v2
SirPlotsALot
Montreal Mar 12
GPT Claude
Orchestrating AI Agents for Building Performance: LangGraph, RAG & Re…
Montreal Mar 12
LangChain LangGraph
Aigent Z: iQube Agent Orchestration
New York City Mar 4
LangChain DB-GPT
Jammy: AI Mood Music
New York City Mar 4
Hume AI EVI 2 GPT-4o
Semantic Web Scraping
Rome Mar 3
GPT-4 LangChain
Self Attention
Prague Feb 25
ChatGPT Streamlit
Agent Laboratory: AI Research Agents
Prague Feb 25
Python GPT-4o
Multimodal AI: Analytical Comparison
Boston Feb 24
GPT-4o Claude-3
AI Startup Scout
Seattle Feb 21
GPT-4o Streamlit
Ollama Groq Local Inference
Manizales Jan 22
Llama-2 Mistral
Building GPT-2 LLMs from scratch
Dubai Jan 5
GPT-2 GPT models
Spreadsheets: Browser GPT-2 Implementation
Seattle Dec 11
GPT-2 JavaScript
GraphRAG: Improving RAG Accuracy
Bogotá Oct 30
RAG GraphRAG