Technology

RAG

RAG (Retrieval-Augmented Generation) is the GenAI framework that grounds LLMs (like GPT-4) on external, verified data, drastically reducing model hallucinations and providing verifiable sources.

RAG is a critical GenAI architecture: it solves the LLM 'hallucination' problem by inserting a retrieval step before generation. A user query is vectorized, then used to query an external knowledge base (e.g., a Pinecone vector database) for relevant document chunks (typically 512-token segments). These retrieved facts augment the original prompt, providing the LLM (e.g., Gemini or Llama 3) the specific, current, or proprietary context required. This process ensures the final response is accurate and grounded in domain-specific data, avoiding the high cost and latency of full model retraining.

https://en.wikipedia.org/wiki/Retrieval-augmented_generation

254 projects · 61 cities

Related technologies

OpenAI API 500 GPT-4 678 Python 739 LangChain 439 GPT-3 390 React 260 BERT 186 Llama-2 337 FastAPI 159 RoBERTa 118 BLOOM 116 Next 197 PaLM 2 117 Gemini 254 Supabase 93 Docker 157 PyTorch 264 PostgreSQL 144

Recent Talks & Demos

Showing 41-64 of 254

Members-Only

Sign in to see who built these projects

Sign in View FAQ

Observability for Reliable AI Agents

OpenAI API Anthropic API

AI Job Agent: Vertex AI + Actions

text-bison Vertex AI

SuperLango: Personalized Vocabulary Engine

LangChain React Native

Gatewayz: Multi-LLM Routing and Cost

Montreal Jan 21

Claude Code OpenAI API

Infinite Wiki: Cooperative World Building

RAG Knowledge Graphs

RAG Authorization for Sensitive Data

Chroma LangChain

PRESENT: Voice Steward Architecture

CLU: Multi-Agent Orchestration

CLU GRID Framework

DARIA: Multi-modal Assessment Pipeline

MindServe AI: GPU Vision and RAG

New York City Dec 9

YOLOv8 Pinecone

Neo4j: StrangerGraphs and GraphRAG

Neo4j Graph database

Empathetic Development: AI Personas Validate

OpenAI Dashboard: Data Insights

GPT-4 LangChain

Number Theory: AI, Crypto, Optimization

Python Apache Kafka

Deepgram OpenAI ElevenLabs Production

Tally: Ambient AI Continuous Memory

Arbiter: Zero-Instrumentation LLM Costs

San Francisco Nov 20

OpenAI SDK Gemini

XCT v2: Idea to Previz Pipeline

Los Angeles Nov 20

Flowise OpenAI API

Youz: AI Landmark Guide

Nürnberg Nov 20

Semantic Caching for Agent Systems

New York City Nov 17

CityPulse: Multi-Modal Video Understanding

New York City Nov 17

Groq Llama 3 Google

Crawling to RAG AI Assistants

OpenAI API pgvector