.

Technology

Transformers

The deep learning architecture that revolutionized sequence modeling (NLP, vision) by replacing recurrent units with a parallelizable multi-head self-attention mechanism.

The Transformer: a neural network architecture introduced in the landmark 2017 paper, "Attention Is All You Need." It eliminated the sequential processing bottleneck of prior Recurrent Neural Networks (RNNs) by relying solely on self-attention, enabling massive parallelization and significantly faster training (up to 10x faster) on modern hardware. This efficiency allowed for the creation of large-scale pre-trained models: BERT (encoder-only) and the generative GPT series (decoder-only). The architecture is now foundational to all modern Large Language Models (LLMs) and drives the current state-of-the-art in AI.

https://doi.org/10.48550/arXiv.1706.03762
168 projects · 57 cities

Related technologies

Recent Talks & Demos

Showing 1-24 of 168

Members-Only

Sign in to see who built these projects

Optimización de recursos para LLMs
Bogotá
Transformers PEFT
SAM: Portable ONNX/C++ Implementation
Lausanne Apr 30
SAM2 ONNX Runtime
Nanochat: Train LLMs from Scratch
Brussels Apr 1
Python Torch
LogAnalyzer: LLM Log Anomaly Detection
Manizales Mar 25
FastAPI Vue 3
Niuwn AI: Personal AI Twins
Bremen Mar 25
Python FastAPI
Words to World: AI Models
San Diego Feb 26
Unreal Engine 5 PyTorch
Reality Check: Personal Fact-Checking
Tokyo Feb 19
Claude Code OpenAI Codex
Hugging Face RAG: Reduce Hallucinations
Tiruchirappalli Jan 31
Transformers RAG
JobsYo: Multi-Model Job Search AI
Toronto Jan 29
GPT-5 Gemini 3
Transformers Detect Netflow Anomalies
Toronto Jan 29
Python Transformers
Biological Age from Blood Work
Seattle Dec 18
GPT-4 OpenAI API
Diagnosable ColBERT: Debugging Vector Search
Brussels Dec 17
GPT-4 Claude-3
x402-Enabled AI Gateway
Atlanta Dec 16
GPT-4 LangChain
SLM Fine-tuning on 16GB CPU
Waterloo Dec 15
LangChain Transformers
AI-First Clinical Trials EDC
Chicago Dec 9
React Spring Boot
fastworkflow: SOTA with Small Models
Houston Dec 9
GPT-4 Claude-3
Science of Intelligence
Portland Dec 3
GPT-4 LangChain
NotebookLM: Grounded Academic Research
Asuncion Nov 27
GPT-4 LangChain
AI Management: Robotics Safety Standards
Hong Kong Nov 27
GPT-4 LangChain
Paradigm: Understand Legacy Code
Poland Nov 26
GPT-4 LangChain
Arbiter: Zero-Instrumentation LLM Costs
San Francisco Nov 20
OpenAI SDK Gemini
Constrained Decoding: LLM Pixel Art
Montreal Nov 20
Modal Transformers
Sentence Transformers: Content Categorization
Nürnberg Nov 20
GPT-4 LangChain
GPT-5 Ad Campaign Simulator
Boston Nov 17
GPT-4 LangChain