.

Technology

RAG

RAG (Retrieval-Augmented Generation) is the GenAI framework that grounds LLMs (like GPT-4) on external, verified data, drastically reducing model hallucinations and providing verifiable sources.

RAG is a critical GenAI architecture: it solves the LLM 'hallucination' problem by inserting a retrieval step before generation. A user query is vectorized, then used to query an external knowledge base (e.g., a Pinecone vector database) for relevant document chunks (typically 512-token segments). These retrieved facts augment the original prompt, providing the LLM (e.g., Gemini or Llama 3) the specific, current, or proprietary context required. This process ensures the final response is accurate and grounded in domain-specific data, avoiding the high cost and latency of full model retraining.

https://en.wikipedia.org/wiki/Retrieval-augmented_generation
254 projects · 61 cities

Related technologies

Recent Talks & Demos

Showing 41-64 of 254

Members-Only

Sign in to see who built these projects

Observability for Reliable AI Agents
Toronto Jan 29
OpenAI API Anthropic API
AI Job Agent: Vertex AI + Actions
Atlanta Jan 27
text-bison Vertex AI
SuperLango: Personalized Vocabulary Engine
Poland Jan 27
LangChain React Native
Gatewayz: Multi-LLM Routing and Cost
Montreal Jan 21
Claude Code OpenAI API
Infinite Wiki: Cooperative World Building
Cologne Jan 21
RAG Knowledge Graphs
RAG Authorization for Sensitive Data
Tokyo Jan 15
Chroma LangChain
PRESENT: Voice Steward Architecture
Seattle Dec 18
Next LiveKit
CLU: Multi-Agent Orchestration
Miami Dec 18
CLU GRID Framework
DARIA: Multi-modal Assessment Pipeline
Raleigh Dec 10
React Python
MindServe AI: GPU Vision and RAG
New York City Dec 9
YOLOv8 Pinecone
Neo4j: StrangerGraphs and GraphRAG
Paris Dec 9
Neo4j Graph database
Empathetic Development: AI Personas Validate
Seattle Dec 8
Gemini React
Scholé AI
Lausanne Dec 3
RAG MultiModN
OpenAI Dashboard: Data Insights
Boston Dec 2
GPT-4 LangChain
Number Theory: AI, Crypto, Optimization
Boston Dec 2
Python Apache Kafka
Deepgram OpenAI ElevenLabs Production
Bogotá Nov 27
GPT-4 Deepgram
Tally: Ambient AI Continuous Memory
London Nov 25
Whisper Gemini
Arbiter: Zero-Instrumentation LLM Costs
San Francisco Nov 20
OpenAI SDK Gemini
XCT v2: Idea to Previz Pipeline
Los Angeles Nov 20
Flowise OpenAI API
Youz: AI Landmark Guide
Nürnberg Nov 20
GPT-4 Claude-3
Semantic Caching for Agent Systems
New York City Nov 17
RAG AGENTS
CityPulse: Multi-Modal Video Understanding
New York City Nov 17
Ollama FastAPI
Dr Auntie
Dubai Nov 15
Groq Llama 3 Google
Crawling to RAG AI Assistants
Dubai Nov 15
OpenAI API pgvector