LLMs: Data Extraction Automation

Demonstrating how large language models automate ETL tasks, extracting structured values from unstructured text with code examples ranging from basics to advanced techniques.

Overview

ETL and data processing/extraction is one of the most useful generalized tasks LLMs can pull off. This coding demo would walk through the basics and advanced capabilities of generative AI for explicit data extraction.

Tech stack

Related projects

Extracting structured information using LLMs

New York City

Learn how to use OpenAITool for clean schema generation, turn functions into tools, and extract Pydantic models directly…

LLM drives a web browser

New York City

This talk demonstrates an open-source interface that enables large language models to interact with web pages through a…

Extraction: Making Using Tools With OpenAI Clean And Simple

Los Angeles

This talk covers defining and using tools as Pydantic models for validation, leveraging functions as tools, and extracting…

Mapping AI Companies

Boston

Learn how to use LLM embeddings to map AI startups into interactive galaxy visualizations, enabling multidimensional search and…

AI Decision-Making in Low / No Trainable Data Domains

This talk explores using expert-created rules of thumb to guide large language models in specialized domains with little…

Thinking LLMs

Los Angeles

This talk explains how to generate synthetic data for training custom o1 style language models using methods from…