.

Technology

Transformers

The deep learning architecture that revolutionized sequence modeling (NLP, vision) by replacing recurrent units with a parallelizable multi-head self-attention mechanism.

The Transformer: a neural network architecture introduced in the landmark 2017 paper, "Attention Is All You Need." It eliminated the sequential processing bottleneck of prior Recurrent Neural Networks (RNNs) by relying solely on self-attention, enabling massive parallelization and significantly faster training (up to 10x faster) on modern hardware. This efficiency allowed for the creation of large-scale pre-trained models: BERT (encoder-only) and the generative GPT series (decoder-only). The architecture is now foundational to all modern Large Language Models (LLMs) and drives the current state-of-the-art in AI.

https://doi.org/10.48550/arXiv.1706.03762
168 projects · 57 cities

Related technologies

Recent Talks & Demos

Showing 21-44 of 168

Members-Only

Sign in to see who built these projects

Arbiter: Zero-Instrumentation LLM Costs
San Francisco Nov 20
OpenAI SDK Gemini
Constrained Decoding: LLM Pixel Art
Montreal Nov 20
Modal Transformers
Sentence Transformers: Content Categorization
Nürnberg Nov 20
GPT-4 LangChain
GPT-5 Ad Campaign Simulator
Boston Nov 17
GPT-4 LangChain
CityPulse: Multi-Modal Video Understanding
New York City Nov 17
Ollama FastAPI
Finetuning SLMs for Agents
Amsterdam Nov 11
Distill Labs Transformers
Scalable Production RAG Architecture
Toronto Nov 10
FAISS OpenAI API
This Is So You! Event Newsletter
Toronto Nov 10
FastAPI PostgreSQL
This Is So You! Event Newsletter
Toronto Nov 10
FastAPI PostgreSQL
This Is So You! Event Newsletter
Toronto Nov 10
FastAPI PostgreSQL
Instruct Lab LLM Evaluation Playbook
Toronto Nov 10
Merlinite-7B-Lab Mistral Mixtral
CityPulseNYC: Multi-Modal RAG
New York City Nov 6
Ollama LLaMA 3B
Tracking AI code
New York City Nov 6
GPT-4 Llama-2
WikiMem
Minneapolis Saint Paul Nov 5
GPT-4 LangChain
Secure AI Health Assistant with EHR
Dhaka Nov 1
OpenAI API FastAPI
Optimizing Agent Latency with Evals
San Francisco Oct 30
GPT-4 LangChain
RapidFire AI: Parallel LLM Experimentation
San Diego Oct 29
PyTorch Transformers
IA Aplicada: Embeddings y Validación GPT
Santiago Oct 29
GPT-4 LangChain
AI, Humor Chileno e Identidad
Santiago Oct 29
GPT-4 Claude-3
Polytopia: Large Action Space AI
Nashville Oct 28
GPT-4 LangChain
AI Global Admissions Automation
Seattle Oct 22
GPT-4 Claude-3
Loose Control
Hong Kong Oct 22
GPT-4 LangChain
Production AI Agents for Healthcare, Finance
Hong Kong Oct 22
GPT-4 Claude-3
Agentic Loops: Human Intervention Patterns
San Francisco Oct 16
GPT-4 Llama-2