.

Technology

Transformers

The deep learning architecture that revolutionized sequence modeling (NLP, vision) by replacing recurrent units with a parallelizable multi-head self-attention mechanism.

The Transformer: a neural network architecture introduced in the landmark 2017 paper, "Attention Is All You Need." It eliminated the sequential processing bottleneck of prior Recurrent Neural Networks (RNNs) by relying solely on self-attention, enabling massive parallelization and significantly faster training (up to 10x faster) on modern hardware. This efficiency allowed for the creation of large-scale pre-trained models: BERT (encoder-only) and the generative GPT series (decoder-only). The architecture is now foundational to all modern Large Language Models (LLMs) and drives the current state-of-the-art in AI.

https://doi.org/10.48550/arXiv.1706.03762
168 projects · 57 cities

Related technologies

Recent Talks & Demos

Showing 41-64 of 168

Members-Only

Sign in to see who built these projects

AI Global Admissions Automation
Seattle Oct 22
GPT-4 Claude-3
Loose Control
Hong Kong Oct 22
GPT-4 LangChain
Production AI Agents for Healthcare, Finance
Hong Kong Oct 22
GPT-4 Claude-3
Agentic Loops: Human Intervention Patterns
San Francisco Oct 16
GPT-4 Llama-2
BERT Fine-tuning on MultiNLI
Houston Oct 14
Claude Code Transformers
KG/LLM Synergy Demo
Tokyo Oct 10
GPT-4 LangChain
CometML: Tracking AI Experiments
Austin Oct 9
GPT-4 LangChain
Full-Precision LLMs on Small Machines
London Oct 7
Transformers Apple MLX
Booplet: Reproducible Tool Callers
Singapore Oct 7
GPT-4 Claude-3
OneNote AI Q&A System
Seattle Sep 30
GPT-4 LangChain
FiftyOne Visual Similarity Search
Raleigh Sep 30
FiftyOne CLIP
Candy Mountain: Agentic Project Automation
Boston Sep 29
GPT-4 Claude-3
Hexagone: Anonymize Data for AI
Paris Sep 18
vLLM Transformers
Hexagone AI: Multimodal Anonymization
Paris Sep 18
React Next
AutoRAG: Specialized AI Datasets
Brisbane Sep 11
Llama-3-8B-Instruct FAISS
fastWorkflow: Deterministic Conversational AI
Houston Sep 9
LiteLLM Transformers
Claude: Agentic bmad development
Seattle Sep 8
GPT-4 Claude
BigQuery GenAI: Advanced SQL Analytics
Miami Sep 3
GPT-4 Gemini
Agentic AI Design and Deployment
Calgary Aug 28
GPT-4 LangChain
Self-builidng game worlds
London Aug 28
GPT-4 OpenAI API
AI Eyes
Dubai Aug 23
GPT-4 LangChain
LLM Safety: Model vs Prompt
Dubai Aug 23
GPT-4 GPT-3
OpenSpec: Spec-Driven AI Development
Sydney Aug 20
GPT-4 Claude
Production LLM Cost Optimization
Orange County Jul 31
Transformers vLLM