Technology

RoBERTa

RoBERTa (Robustly Optimized BERT Pretraining Approach) is a high-performance language model from Facebook AI that significantly outperforms BERT by optimizing the pretraining strategy, not the core architecture.

RoBERTa is a robustly optimized version of the BERT model, developed by researchers at Facebook AI in 2019. The team conducted a replication study, proving BERT was undertrained and could achieve state-of-the-art results with a refined recipe: they removed the Next Sentence Prediction (NSP) objective, implemented dynamic masking, and scaled up training dramatically. Specifically, RoBERTa trained for 500K steps (up from 100K) on a massive 160GB of text data (ten times BERT’s data) using much larger batch sizes (up to 8K). This optimized approach yielded superior performance on major benchmarks like GLUE, RACE, and SQuAD, establishing RoBERTa as a benchmark for subsequent language model development.

https://arxiv.org/abs/1907.11692

118 projects · 40 cities

Related technologies

BERT 186 GPT-3 390 GPT-4 678 BLOOM 116 Llama-2 337 PaLM 2 117 RAG 253 scikit-learn 86 TensorFlow 97 Keras 76 ONNX 87 PyTorch 264 Python 739 Prompt Engineering 42 Generative AI 52 Fine-tuning 40 AI agents 44 ChatGPT 96

Recent Talks & Demos

Showing 81-104 of 118

Members-Only

Sign in to see who built these projects

Sign in View FAQ

LLMs Generate Knowledge Graphs

Knowledge Graphs Structured Networks

GPT-4 Chain-of-Thought

LLM Knowledge Graph Generation

New York City Jun 4

GPT-4 Knowledge Graphs

Reliable AI Agents

New York City May 22

Twitter '95 AI Social Simulation

New York City May 22

Synthasaizer: LLMs and Time Travel

Los Angeles May 21

GPT-4 synthasaizer

Obsidian: Non-Linear AI Chat

ChatGPT Obsidian

LLMs Analyze Qualitative Data

GPT-4 Topic Modeling

Vocal Docs: Talkable Document Editor

GPT-4 Transcription

RAG Copilot for Big Data extraction

Medellín Apr 25

CrustyCrab: LLM C-to-Rust Translator

New York City Apr 24

LLM Language Learning App

Libra: AI Legal Argument Analysis

GPT-4 Prompt Engineering

Viewpoint.AI: Accelerating Group Decisions

Eastside Entrepreneurs Mar 7

GPT-4 Neural networks

LLM Causal Modeling for Social Science

Pokemon LLM Battle Calculations

GPT-4 Fine-tuning

AI Filter: Local LLM Filtering

Chrome extension Twitter

Los Angeles Feb 8

GPT-4 Generative AI

OpenAI Tools and Extraction

Los Angeles Feb 8

Pydantic OpenAI API

EidOS: Agent Operating System

EidOS Kubernetes

nutritionGPT: LLM Nutrition Pipeline

nutritionGPT GPT-4

dstack: GPU Workloads on Any Cloud

midpage: Fixing Vector Search Context

Vector Search Embeddings

Akkadian Oracle

Los Angeles Jan 10