RoBERTa Projects

Technology

RoBERTa

RoBERTa (Robustly Optimized BERT Pretraining Approach) is a high-performance language model from Facebook AI that significantly outperforms BERT by optimizing the pretraining strategy, not the core architecture.

RoBERTa is a robustly optimized version of the BERT model, developed by researchers at Facebook AI in 2019. The team conducted a replication study, proving BERT was undertrained and could achieve state-of-the-art results with a refined recipe: they removed the Next Sentence Prediction (NSP) objective, implemented dynamic masking, and scaled up training dramatically. Specifically, RoBERTa trained for 500K steps (up from 100K) on a massive 160GB of text data (ten times BERT’s data) using much larger batch sizes (up to 8K). This optimized approach yielded superior performance on major benchmarks like GLUE, RACE, and SQuAD, establishing RoBERTa as a benchmark for subsequent language model development.

https://arxiv.org/abs/1907.11692

118 projects · 40 cities

Related technologies

BERT 179 GPT-3 191 GPT-4 528 BLOOM 115 Llama-2 227 PaLM 2 116 RAG 138 scikit-learn 82 TensorFlow 90 Keras 74 ONNX 82 PyTorch 265 Python 618 Generative AI 45 Large Language Models 7 Prompt Engineering 28 Fine-tuning 20 AI agents 35

Recent Talks & Demos

Showing 1-24 of 118

Members-Only

Sign in to see who built these projects

Sign in View FAQ

Honcho: Self-Improving Agents

New York City Feb 17

Honcho OpenClaw

AI Stops AWS Cloud Waste

Terraform CloudFormation

Code Quality: Hiding LLM Non-determinism

Brussels Feb 11

Kuralit: Intent-Driven Mobile Interface

Tiruchirappalli Jan 31

AI: Organizational Context Translation

GPT-4 Prompting

Conversation Games: Designing for AI

Temporal DeepRAG Conversational Workflows

Bengaluru Nov 29

Temporal Python

classifai.dev: Self-Improving Classification

Los Angeles Oct 20

Juggernaut Labs: Sustainable AI Systems

Waterloo Oct 20

GPT-4 Cloud Platform

Agentic AI Product Discovery

Singapore Oct 7

GPT-4 LangChain

PaddlePaddle: Structuring Legal Docs

Hong Kong Aug 22

Python PaddleOCR

LLM Dialogue: Memory and TTS Demo

Timee: Automating Real-World Chores

Node Google APIs

AI Decodes East Asian Archives

Hong Kong May 29

End-to-end AI Autograding

Singapore May 21

RAG: Building Advanced Systems

Databricks Agents for Data Engineers

Python Databricks

OncallNinja: AI Oncall Engineer

GPT-4 Google Cloud Platform

GRPO: Rust LLM Training

Los Angeles Feb 24

LLMs and Domain-Specific Languages

GPT-4 AI Dungeon Master

CRM AI Agent for Emails

STRIDE: Automated Development Framework

Transformer Lab

Waterloo Feb 10

Transformer Lab GPT-4