.

Members-Only

Recent Talks & Demos are for members only

Exclusive feed

You must be an AI Tinkerers active member to view these talks and demos.

March 20, 2025 · Poland

browser-use: Agent Prompt Structure

Live demo of a low‑cost LLM browser agent signing up for thousands of newsletters, scraping varied sites, and exposing its Langfuse prompt structure and patterns.

Overview
Links
Tech stack
  • Python
    Python: The high-level, general-purpose language built for readability, powering everything from web backends to advanced machine learning models.
    Python is the high-level, general-purpose language prioritizing clear, readable syntax (via significant indentation), ensuring rapid development for any team . Its ecosystem is massive: use it for robust web development with frameworks like Django and Flask, or leverage its power in data science with libraries such as Pandas and NumPy . The Python Package Index (PyPI) provides thousands of community-contributed modules, offering immediate solutions for tasks from network programming to GUI creation . The language is actively maintained by the Python Software Foundation (PSF), with the stable release currently at Python 3.14.0 (as of November 2025) .
  • Gemini Vertex AI
    Vertex AI is the unified, fully-managed platform for building, customizing, and deploying Google's Gemini multimodal models and 200+ foundation models at enterprise scale.
    Gemini on Vertex AI delivers Google's most capable multimodal models (e.g., Gemini 3 Pro, 2.5 Flash) directly into your enterprise workflow. Use Vertex AI Studio to rapidly prototype with text, images, and video, then leverage the Model Garden to access over 200 proprietary and open-source models. The platform provides a complete MLOps toolset—including Vertex AI Pipelines and Evaluation—to manage, govern, and scale your generative AI applications with Google Cloud's security and data integration (BigQuery) built-in.
  • Langfuse
    Langfuse is the open-source LLM engineering platform: gain full observability, manage prompt versions, and run production-grade evaluations.
    Langfuse delivers the essential LLM engineering stack. It's an open-source platform for full-lifecycle management of your AI applications (agents, chains, etc.). Use its comprehensive tracing (OpenTelemetry-based) to debug complex, non-deterministic interactions and track exact cost/latency metrics. The system provides robust prompt management (versioning, A/B testing) and flexible evaluation tools to measure output quality and monitor production health. Integrations are native: connect with Langchain, OpenAI, and LlamaIndex via Python/JS SDKs for immediate control and clarity over your LLM deployment.
  • Browser
    The client application that fetches, interprets, and renders content (HTML, CSS, JavaScript) from web servers for user display.
    A browser is your primary interface to the World Wide Web, operating on a client-server model: it sends HTTP/HTTPS requests for files and uses a rendering engine to translate the returned data into a visual page. This process involves interpreting HTML structure, applying CSS styles, and executing JavaScript for dynamic functionality. Major players like Google Chrome, Mozilla Firefox, Apple Safari, and Microsoft Edge dominate the market, with Chrome currently holding a significant majority share. The core function remains consistent: transforming raw code into the interactive experience you see on your screen.

Related projects