Technology

RAG

RAG (Retrieval-Augmented Generation) is the GenAI framework that grounds LLMs (like GPT-4) on external, verified data, drastically reducing model hallucinations and providing verifiable sources.

RAG is a critical GenAI architecture: it solves the LLM 'hallucination' problem by inserting a retrieval step before generation. A user query is vectorized, then used to query an external knowledge base (e.g., a Pinecone vector database) for relevant document chunks (typically 512-token segments). These retrieved facts augment the original prompt, providing the LLM (e.g., Gemini or Llama 3) the specific, current, or proprietary context required. This process ensures the final response is accurate and grounded in domain-specific data, avoiding the high cost and latency of full model retraining.

https://en.wikipedia.org/wiki/Retrieval-augmented_generation

254 projects · 61 cities

Related technologies

OpenAI API 500 GPT-4 678 Python 739 LangChain 439 GPT-3 390 React 260 BERT 186 Llama-2 337 FastAPI 159 RoBERTa 118 BLOOM 116 Next 197 PaLM 2 117 Gemini 254 Supabase 93 Docker 157 PyTorch 264 PostgreSQL 144

Recent Talks & Demos

Showing 1-24 of 254

Members-Only

Sign in to see who built these projects

Sign in View FAQ

Scaling 780k Page Hybrid Search

Voice AI Expert Interviews

Compose and Dragons: Tiny LLMs

Docker Jan-nano-gguf

One Human Company: Agent Orchestration

Ho Chi Minh City Apr 18

VisionClaw: Autonomous AI Corporation

GPT-4 Claude Opus 4

St Louis Apr 14

Miró: Synthetic Audience Analysis

Manizales Mar 25

MiroFish: Inteligencia de enjambre multi-agente

Manizales Mar 25

GraphRAG Docker

MCP: Knowledge Graph Architecture Consultant

Model Context Protocol (MCP) Claude Code

UofT: Reliable Policy RAG

UofT: AI Job Discovery Engine

FastAPI PostgreSQL

UofT: Intelligent Document Search

Claude Code: Self-Learning AI Agents

Manchester Nh Mar 18

Claude Code Gemini CLI

VLLM and Qdrant: GPU Benchmarking

Manchester Nh Mar 18

NVIDIA Grace-Blackwell: Local AI Supercomputing

Grace-Blackwell DGX Spark

Cloud AI: Elasticity and Scale

Kubernetes Docker

AI WhatsApp Fallas Guide

Valencia Mar 17

EUACC.AI: Fast European Funding

Valencia Mar 17

OpenData.org: Open Entity Graph

Orange County Mar 11

Senzing Relational database

GreenNode: Enterprise AI Agents

Ho Chi Minh City Mar 7

ai-flow.eu: Systematic LLM Testing

Embeddings Beyond RAG

Words to World: AI Models

San Diego Feb 26

Unreal Engine 5 PyTorch

Clasio: 6D Gemini Document Intelligence

San Diego Feb 26

Gemini Cloud Run Gen2