Members-Only
Recent Talks & Demos are for members only
You must be an AI Tinkerers active member to view these talks and demos.
RAGBuilder: Optimize RAG Setup
Learn how RAGBuilder automatically optimizes chunking, embedding models, and other RAG parameters, evaluates configurations on test data, and demonstrates its new SDK.
RAGBuilder is an open-source toolkit that helps you create optimal Production-ready RAG (Retrieval-Augmented-Generation) setup for your data automatically. By performing hyperparameter optimization on various RAG parameters (Eg: chunking strategy, chunk size, embedding models etc.), RAGBuilder evaluates these configurations against a test dataset to identify the best-performing setup for your data.
A toolkit optimizing production-ready RAG setups via hyperparameter tuning for data.
- RAGBuilderRAGBuilder is the automated toolkit for hyperparameter tuning and deploying optimal, production-ready Retrieval-Augmented Generation (RAG) pipelines.RAGBuilder automates the complex, time-intensive process of RAG pipeline optimization. The tool efficiently identifies the best configuration for your data by performing hyperparameter tuning across granular parameters: chunking strategy (semantic, character), chunk size (e.g., 1000, 2000), embedding models, and retrievers. This process bypasses the manual testing of thousands of combinations—a toy example shows 78,125 distinct setups—by evaluating configurations against a test dataset for metrics like accuracy and latency. Utilize state-of-the-art, pre-defined RAG templates to accelerate development, ensuring a high-performance, production-grade RAG system is ready in minutes.
- RAGRAG (Retrieval-Augmented Generation) is the GenAI framework that grounds LLMs (like GPT-4) on external, verified data, drastically reducing model hallucinations and providing verifiable sources.RAG is a critical GenAI architecture: it solves the LLM 'hallucination' problem by inserting a retrieval step before generation. A user query is vectorized, then used to query an external knowledge base (e.g., a Pinecone vector database) for relevant document chunks (typically 512-token segments). These retrieved facts augment the original prompt, providing the LLM (e.g., Gemini or Llama 3) the specific, current, or proprietary context required. This process ensures the final response is accurate and grounded in domain-specific data, avoiding the high cost and latency of full model retraining.
- SDKA Software Development Kit (SDK) is a comprehensive package of tools, libraries, and documentation used to build applications for a specific platform or service.The SDK is your essential developer toolkit: it bundles pre-built components (libraries, APIs, code samples) for rapid application development. It allows engineers to integrate complex functionalities—like push notifications or mobile payments—without writing everything from scratch, significantly accelerating time-to-market. For instance, the Android SDK provides the framework for accessing device-specific features (camera, GPS); Apple's Xcode includes the Integrated Development Environment (IDE) for macOS and iOS development; and the AWS SDK enables direct integration with over 200 cloud services (S3, EC2) using multiple languages (Python, Java). An SDK’s core value is simplification: it abstracts away low-level plumbing (authentication, error handling) so your team focuses on feature delivery.
- Generative AIGenerative AI employs foundation models (e.g., Large Language Models) to create novel, complex content—text, images, code, and audio—from simple user prompts.Generative AI is a deep learning paradigm focused on *creating* new output, not just classifying data. Key models like OpenAI's GPT-4 and Stability AI's Stable Diffusion leverage massive datasets (trillions of parameters) to identify complex patterns. This enables them to generate high-quality, original content: from drafting software code and summarizing 50-page reports to producing photorealistic images in seconds. It fundamentally shifts the human-computer interaction model from command-based to prompt-based creation, driving immediate, high-impact productivity gains across all industries.
- Embedding modelsEmbedding models convert complex, high-dimensional data (text, images, audio) into dense, low-dimensional numerical vectors, enabling machines to process semantic meaning and relationships.Embedding models are the core engine for modern AI applications: they transform unstructured data—like a document or an image—into a fixed-length, N-dimensional vector (a list of numbers). This vector is an 'embedding,' a numerical representation where semantic similarity is encoded by spatial proximity (closer vectors mean more related concepts). For example, models like Word2Vec and BERT generate these vectors, typically in dimensions such as 768 or 1536. This process is critical for tasks like semantic search, clustering, and Retrieval-Augmented Generation (RAG), allowing systems to accurately find the most relevant information based on meaning, not just keyword matching.
Related projects
AuraML - Synthetic Data Generation
Bengaluru
Learn how to convert real-world environments into 3D simulations using a cloud platform to generate synthetic vision datasets,…
THub - A Low Code No Code Platform for building GenAI apps
Bengaluru
Discover THub, a low-code platform for building GenAI apps with drag-and-drop ease. Explore LLMs like GPT, Gemini, Claude,…
GPT-RAG Template using Semantic Kernel and Load Testing
Boston
Learn how to implement the GPT‑RAG template with Semantic Kernel, configure zero‑trust security, and perform load testing for…
AutoRAG
Berlin
This talk covers automatic hyperparameter optimization for retrieval augmented generation pipelines, focusing on tuning retrieval settings like chunk…
AI agents to conduct recruiting interviews
Bengaluru
This talk explores building modular AI agents that conduct dynamic, unbiased recruiting interviews using LLMs and cognitive architectures…
RAGistotle - a multi modal RAG
Dubai
Explore RAGistotle, a multi-modal RAG handling text, audio, video, data files, YouTube, and images. See how diverse LLMs…