Members-Only
Recent Talks & Demos are for members only
You must be an AI Tinkerers active member to view these talks and demos.
vvvdeo: SAM2 Video Memes
Learn how to use vvvdeo, a lightweight video editor, to segment videos, apply speed changes, and attach images using the SAM2 model via a Python backend.
I’ll be presenting vvvdeo, an early in development video editor website focused on being easy to use, free of cluttering ads, and fast enough to create video memes in seconds (or minutes, since the demo will be local, running inference using the CPU).
Cut and speed up videos with ffmpeg.wasm and an ffmpeg server-side implementation. Segment videos and attach images to specific segments using Meta’s open-source SAM2 model, integrated into a custom Python backend written by me (and Claude).
vvvdeo: ffmpeg-powered video editor with SAM2 segmentation and Go backend.
- GPT-4GPT-4 is OpenAI’s large multimodal model: it processes both text and image inputs, delivering human-level performance on complex professional and academic benchmarks.This is OpenAI’s latest milestone in scaling deep learning: a large multimodal model accepting both text and image inputs. It demonstrates a significant capability leap over its predecessor, scoring in the top 10% on a simulated bar exam (GPT-3.5 scored in the bottom 10%). The model handles nuanced instructions and long-form content, supporting context windows up to 32,768 tokens (32K model). This capacity allows processing up to 25,000 words in a single, complex prompt. GPT-4 is engineered for enhanced reliability, steerability, and advanced reasoning across diverse tasks.
- LangChainThe open-source framework for building and deploying reliable, data-aware Large Language Model (LLM) applications.LangChain is the essential framework for engineering LLM-powered applications: it simplifies connecting models (like GPT-4 or Claude) to external data, computation, and APIs. The platform provides a modular set of components—Chains, Agents, Tools, and Memory—allowing developers to quickly build complex workflows like Retrieval-Augmented Generation (RAG) pipelines and sophisticated conversational agents. Its Python and JavaScript libraries, combined with LangChain Expression Language (LCEL), offer a standardized interface for rapid prototyping and moving applications to production with confidence.
- OpenAI APIOpenAI API: Your direct gateway to cutting-edge AI models (GPT-4o, DALL-E 3, Whisper), enabling scalable, multimodal intelligence integration into any application.The OpenAI API provides authenticated, programmatic access to a powerful suite of generative AI models. Developers leverage REST endpoints and official libraries (Python, Node.js) to integrate capabilities like advanced text generation (GPT-4o), image creation (DALL-E 3), and speech-to-text transcription (Whisper). This platform is engineered for scale, supporting millions of daily requests for tasks from complex reasoning to real-time customer support agents, ensuring your application gets reliable, state-of-the-art intelligence.
- PythonPython: The high-level, general-purpose language built for readability, powering everything from web backends to advanced machine learning models.Python is the high-level, general-purpose language prioritizing clear, readable syntax (via significant indentation), ensuring rapid development for any team . Its ecosystem is massive: use it for robust web development with frameworks like Django and Flask, or leverage its power in data science with libraries such as Pandas and NumPy . The Python Package Index (PyPI) provides thousands of community-contributed modules, offering immediate solutions for tasks from network programming to GUI creation . The language is actively maintained by the Python Software Foundation (PSF), with the stable release currently at Python 3.14.0 (as of November 2025) .
- DockerDocker is the open-source platform that packages applications and dependencies into standardized, portable containers for consistent execution across any environment.Docker is the industry-standard containerization platform, enabling developers to build, ship, and run applications efficiently. It uses the Docker Engine (the core runtime) to create lightweight, isolated environments called containers: these units bundle an application’s code, libraries, and configuration. This self-contained approach guarantees consistency, eliminating the 'it works on my machine' problem across development, testing, and production environments (local workstations, cloud, or on-premises). Docker debuted in 2013 and now serves over 20 million developers monthly, simplifying complex workflows like CI/CD and microservices architecture by leveraging tools like Docker Hub for image sharing and Docker Compose for multi-container applications.
Related projects
Revideo - Create Videos with Code & AI
Berlin
Learn how Revideo uses TypeScript and WebCodecs to render programmatic video in the browser, achieving 70x faster rendering…
Demo - AI Powered, rich video composition in real-time
Bengaluru
This talk demonstrates real-time AI-powered video composition for both short recordings and live streaming, featuring voice anonymity technology.
V2C.dev - Video to Code
Boston
Learn how to turn a video into functional code, projects, and complete websites by extracting and translating visual…
The CheerLabs
Bengaluru
Explore how the GenMaya engine uses AI to compose video and remix audio in real time, simplifying asset…
EVA - Enhanced Video Archive
Raleigh
Learn how to build a fully local RAG system that indexes video transcripts, timestamps, and keyframes, merges context,…
Summarise.live
Amsterdam
Learn how Summarise.live creates concise, accurate summaries of long videos and podcasts, using personalized algorithms to retain key…