.

Technology

RAG

RAG (Retrieval-Augmented Generation) is the GenAI framework that grounds LLMs (like GPT-4) on external, verified data, drastically reducing model hallucinations and providing verifiable sources.

RAG is a critical GenAI architecture: it solves the LLM 'hallucination' problem by inserting a retrieval step before generation. A user query is vectorized, then used to query an external knowledge base (e.g., a Pinecone vector database) for relevant document chunks (typically 512-token segments). These retrieved facts augment the original prompt, providing the LLM (e.g., Gemini or Llama 3) the specific, current, or proprietary context required. This process ensures the final response is accurate and grounded in domain-specific data, avoiding the high cost and latency of full model retraining.

https://en.wikipedia.org/wiki/Retrieval-augmented_generation
254 projects · 61 cities

Related technologies

Recent Talks & Demos

Showing 1-24 of 254

Members-Only

Sign in to see who built these projects

Scaling 780k Page Hybrid Search
Poland Apr 23
FastAPI vLLM
Voice AI Expert Interviews
Seattle Apr 22
OpenAI Gemini
Compose and Dragons: Tiny LLMs
Paris Apr 21
Docker Jan-nano-gguf
One Human Company: Agent Orchestration
Ho Chi Minh City Apr 18
AI ML
VisionClaw: Autonomous AI Corporation
Chicago Apr 14
GPT-4 Claude Opus 4
Shop Talk
St Louis Apr 14
BLIP CLIP
Miró: Synthetic Audience Analysis
Manizales Mar 25
Python Neo4j
MiroFish: Inteligencia de enjambre multi-agente
Manizales Mar 25
GraphRAG Docker
MCP: Knowledge Graph Architecture Consultant
Denver Mar 25
Model Context Protocol (MCP) Claude Code
UofT: Reliable Policy RAG
Toronto Mar 25
Python RAG
UofT: AI Job Discovery Engine
Toronto Mar 25
FastAPI PostgreSQL
UofT: Intelligent Document Search
Toronto Mar 25
Python FastAPI
Claude Code: Self-Learning AI Agents
Manchester Nh Mar 18
Claude Code Gemini CLI
VLLM and Qdrant: GPU Benchmarking
Manchester Nh Mar 18
vLLM Qdrant
NVIDIA Grace-Blackwell: Local AI Supercomputing
Paris Mar 17
Grace-Blackwell DGX Spark
Cloud AI: Elasticity and Scale
Paris Mar 17
Kubernetes Docker
AI WhatsApp Fallas Guide
Valencia Mar 17
OpenAI API RAG
EUACC.AI: Fast European Funding
Valencia Mar 17
Claude Next
OpenData.org: Open Entity Graph
Orange County Mar 11
Senzing Relational database
GreenNode: Enterprise AI Agents
Ho Chi Minh City Mar 7
RAG LLM
ai-flow.eu: Systematic LLM Testing
Cologne Mar 5
ai-flow Node
Embeddings Beyond RAG
Cologne Mar 5
CLIP RAG
Words to World: AI Models
San Diego Feb 26
Unreal Engine 5 PyTorch
Clasio: 6D Gemini Document Intelligence
San Diego Feb 26
Gemini Cloud Run Gen2