Technology
Evals
10 projects
·
10 cities
Related technologies
Recent Talks & Demos
Showing 1-10 of 10
Optimizing Agent Latency with Evals
San Francisco
Oct 30
GPT-4
LangChain
LLM-Judge: Reliable Immigration AI
New York City
Oct 2
OpenAI API
Pinecone
Evals for Robust NL-to-SQL Agents
Dubai
Aug 23
Python
Anthropic Bedrock
n8n: AI Evals Setup
Hamburg
Aug 14
n8n
RAG
Agentic AI Evaluation
Singapore
Aug 12
LangChain
Langfuse
Patch Party: Live Agent Fixing
London
Jun 25
GPT-4o
Claude-3
LLM Fingerprinting: Model Classification
Toronto
Mar 27
PromptFoo
Ollama
Debate Concierge
Prague
Feb 25
PydanticAI
Tavily
Claude: Finetuning Art Recognition
New York City
Oct 21
Claude
MetGuessr
Configuration-Based LLM Evals
Austin
Jul 11
LLMs
CLI