Technology

Transformer

The Transformer is a neural network architecture that uses a multi-head self-attention mechanism to process sequences in parallel, replacing slower recurrent (RNN) and convolutional (CNN) layers.

The Transformer architecture, introduced in the landmark 2017 paper 'Attention Is All You Need' by Vaswani et al. (Google), revolutionized sequence-to-sequence modeling. It operates entirely on an attention mechanism (multi-head self-attention), eliminating the need for sequential processing via Recurrent Neural Networks (RNNs). This design allows for massive parallelization, drastically reducing training time and enabling the scale-up of models to billions of parameters. It is the foundational technology for all modern Large Language Models (LLMs), including BERT and the Generative Pre-trained Transformer (GPT) series, driving state-of-the-art performance across Natural Language Processing (NLP) and computer vision tasks.

https://arxiv.org/abs/1706.03762

64 projects · 40 cities

Related technologies

Transformers 168 Python 739 PyTorch 264 LangChain 439 FastAPI 159 Docker 157 GPT-4 678 OpenAI API 500 sentence-transformers 17 FAISS 18 pgvector 28 Claude-3 444 Ollama 82 PostgreSQL 144 React 260 Gemini 254 Llama-2 337 LLM 399

Recent Talks & Demos

Showing 1-24 of 64

Members-Only

Sign in to see who built these projects

Sign in View FAQ

Optimización de recursos para LLMs

Transformers PEFT

WhichBox: Multi-modal Vision Search

OpenAI Azure Foundry

SAM: Portable ONNX/C++ Implementation

Lausanne Apr 30

SAM2 ONNX Runtime

Eric Chat: Local Mac AI

Eric Transformer MLX-LM

Nanochat: Train LLMs from Scratch

LogAnalyzer: LLM Log Anomaly Detection

Manizales Mar 25

UofT: AI Job Discovery Engine

FastAPI PostgreSQL

Niuwn AI: Personal AI Twins

Words to World: AI Models

San Diego Feb 26

Unreal Engine 5 PyTorch

Reality Check: Personal Fact-Checking

Claude Code OpenAI Codex

Hugging Face RAG: Reduce Hallucinations

Tiruchirappalli Jan 31

Transformers RAG

JobsYo: Multi-Model Job Search AI

Transformers Detect Netflow Anomalies

Python Transformers

Diagnosable ColBERT: Debugging Vector Search

Brussels Dec 17

SLM Fine-tuning on 16GB CPU

Waterloo Dec 15

LangChain Transformers

AI-First Clinical Trials EDC

React Spring Boot

fastworkflow: SOTA with Small Models

Attention Context Memory Unlocking

Transformer SSM

Number Theory: AI, Crypto, Optimization

Python Apache Kafka

Arbiter: Zero-Instrumentation LLM Costs

San Francisco Nov 20

OpenAI SDK Gemini

Constrained Decoding: LLM Pixel Art

Montreal Nov 20

Modal Transformers

Sentence Transformers: Content Categorization

Nürnberg Nov 20

GPT-4 LangChain

CityPulse: Multi-Modal Video Understanding

New York City Nov 17

Finetuning SLMs for Agents

Amsterdam Nov 11

Distill Labs Transformers