.

Members-Only

Recent Talks & Demos are for members only

Exclusive feed

You must be an AI Tinkerers active member to view these talks and demos.

January 27, 2025 · Boston

LLMs: Data Extraction Automation

Demonstrating how large language models automate ETL tasks, extracting structured values from unstructured text with code examples ranging from basics to advanced techniques.

Overview
Tech stack
  • LLMs
    Large Language Models (LLMs) are Transformer-architecture deep learning systems (e.g., GPT-4, Llama 3) trained on massive text corpora to generate, summarize, and reason over human language at scale.
    LLMs are advanced deep learning models, specifically Generative Pre-trained Transformers (GPTs), designed to process and generate human-like text. They are trained on vast, multi-trillion-token datasets, giving them billions of parameters to learn complex linguistic patterns (syntax, semantics). This scale enables emergent capabilities: few-shot learning, code generation, and complex reasoning. Key examples include OpenAI's GPT-4, Google's Gemini, and Meta's Llama 3. LLMs power applications from conversational AI (ChatGPT) to automated content creation, fundamentally shifting how machines handle unstructured language.
  • Generative AI
    Generative AI employs foundation models (e.g., Large Language Models) to create novel, complex content—text, images, code, and audio—from simple user prompts.
    Generative AI is a deep learning paradigm focused on *creating* new output, not just classifying data. Key models like OpenAI's GPT-4 and Stability AI's Stable Diffusion leverage massive datasets (trillions of parameters) to identify complex patterns. This enables them to generate high-quality, original content: from drafting software code and summarizing 50-page reports to producing photorealistic images in seconds. It fundamentally shifts the human-computer interaction model from command-based to prompt-based creation, driving immediate, high-impact productivity gains across all industries.

Related projects