Members-Only
Recent Talks & Demos are for members only
You must be an AI Tinkerers active member to view these talks and demos.
November 15, 2024
·
Dublin
Nebius: Faster, Cheaper LLMs
Discover how to make inference cheaper and faster using third-party providers like Nebius AI Studio, and see LLM tracing in action. Free credits will be given.
Overview
- Shortly talk about advantages of 3rd party inference providers.
- I will show, how any private inference provider, can be substituted by 3rd party
- Show with Nebius AI Studio Inference
- Showcase LLM tracing
- give away free credits
Links
Nebius offers NVIDIA H100/H200/GB200 GPU clusters via InfiniBand and Kubernetes orchestration.
Scalable LLM inference service offers ultra-low latency via OpenAI-compatible API.
Tech stack