Technology
Monitoring
High-dimensional data collection and alerting for cloud-native infrastructure.
Monitoring is the operational heartbeat of any production system. We use tools like Prometheus and Grafana to transform raw metrics into actionable intelligence. By scraping time-series data (CPU usage, request latency, or error rates) every 15 seconds, we gain a granular view of cluster health. This setup allows us to define precise alerting rules (e.g., firing a PagerDuty notification if 95th percentile latency exceeds 300ms) so we can kill bottlenecks before they impact users. It is about moving from reactive guesswork to proactive, data-driven engineering.
15 projects
·
18 cities
Related technologies
Recent Talks & Demos
Showing 1-15 of 15
ibaAgent: Agentic Time-Series Analysis
Nürnberg
Apr 22
LangGraph
OpenAI GPT
Claude Flow V3: Multi-Agent Swarms
Pereira
Mar 25
Claude Flow V3
Claude API
UofT: Intelligent Document Search
Toronto
Mar 25
Python
FastAPI
Archingeo: AI Safety in Infrastructure
Orange County
Jan 14
NVIDIA Jetson
PyTorch
AI-First Clinical Trials EDC
Chicago
Dec 9
React
Spring Boot
Number Theory: AI, Crypto, Optimization
Boston
Dec 2
Python
Apache Kafka
Arbiter: Zero-Instrumentation LLM Costs
San Francisco
Nov 20
OpenAI SDK
Gemini
Luthien Control: Enforcing AI Behavior
Seattle
Oct 22
LiteLLM
Ollama
DIY Wedding Translator: Three Languages
Tokyo
Oct 10
MediaDevices
WebSocket
Luthien Control: Local AI Control
Seattle
Sep 30
Docker
FastAPI
AI in compliance
Pune
Aug 23
GPT-5
Gemini
NVIDIA LLM Router Blueprint
Sydney
Aug 20
Llama 3
Mixtral 8x22B
HackerNews.coffee: Fast Transparent Personalization
London
Jul 16
Vercel
Neon DB
HandIt.ai: Self-Improving AI Systems
Medellín
Apr 1
GPT-4
Claude-3
ResolveML
San Francisco
Jul 6
ResolveML
LLMs