.

Technology

RoBERTa

RoBERTa (Robustly Optimized BERT Pretraining Approach) is a high-performance language model from Facebook AI that significantly outperforms BERT by optimizing the pretraining strategy, not the core architecture.

RoBERTa is a robustly optimized version of the BERT model, developed by researchers at Facebook AI in 2019. The team conducted a replication study, proving BERT was undertrained and could achieve state-of-the-art results with a refined recipe: they removed the Next Sentence Prediction (NSP) objective, implemented dynamic masking, and scaled up training dramatically. Specifically, RoBERTa trained for 500K steps (up from 100K) on a massive 160GB of text data (ten times BERT’s data) using much larger batch sizes (up to 8K). This optimized approach yielded superior performance on major benchmarks like GLUE, RACE, and SQuAD, establishing RoBERTa as a benchmark for subsequent language model development.

https://arxiv.org/abs/1907.11692
118 projects · 40 cities

Related technologies

Recent Talks & Demos

Showing 1-24 of 118

Members-Only

Sign in to see who built these projects

Honcho: Self-Improving Agents
New York City Feb 17
Honcho OpenClaw
AI Stops AWS Cloud Waste
Raleigh Feb 11
Terraform CloudFormation
Code Quality: Hiding LLM Non-determinism
Brussels Feb 11
Python git
Kuralit: Intent-Driven Mobile Interface
Tiruchirappalli Jan 31
Kuralit iOS
AI: Organizational Context Translation
Atlanta Dec 16
GPT-4 Prompting
Conversation Games: Designing for AI
Chicago Dec 9
Hume AI GPT-4
Temporal DeepRAG Conversational Workflows
Bengaluru Nov 29
Temporal Python
classifai.dev: Self-Improving Classification
Los Angeles Oct 20
GPT-4 CLIP
Juggernaut Labs: Sustainable AI Systems
Waterloo Oct 20
GPT-4 Cloud Platform
Agentic AI Product Discovery
Singapore Oct 7
GPT-4 LangChain
PaddlePaddle: Structuring Legal Docs
Hong Kong Aug 22
Python PaddleOCR
LLM Dialogue: Memory and TTS Demo
Hamburg Aug 14
FastAPI Python
Timee: Automating Real-World Chores
Seattle Jul 24
Node Google APIs
AI Decodes East Asian Archives
Hong Kong May 29
YOLO spaCy
End-to-end AI Autograding
Singapore May 21
Node Express
RAG: Building Advanced Systems
Milan May 8
Python GPT-4
Databricks Agents for Data Engineers
Seattle Apr 24
Python Databricks
OncallNinja: AI Oncall Engineer
Seattle Apr 24
GPT-4 Google Cloud Platform
GRPO: Rust LLM Training
Los Angeles Feb 24
Rust Cargo
LLMs and Domain-Specific Languages
Dublin Feb 24
GPT-4 GPT-3
Wanderheart
Seattle Feb 21
GPT-4 AI Dungeon Master
CRM AI Agent for Emails
Hamburg Feb 20
Docker GPT-4
STRIDE: Automated Development Framework
Austin Feb 13
STRIDE GPT-4
Transformer Lab
Waterloo Feb 10
Transformer Lab GPT-4