Nebius. How to quickly make cheaper and faster | Dublin

Members-Only

Recent Talks & Demos are for members only

Exclusive feed

You must be an AI Tinkerers active member to view these talks and demos.

Sign in View FAQ

November 15, 2024 · Dublin

Nebius: Faster, Cheaper LLMs

Discover how to make inference cheaper and faster using third-party providers like Nebius AI Studio, and see LLM tracing in action. Free credits will be given.

Overview

Shortly talk about advantages of 3rd party inference providers.
I will show, how any private inference provider, can be substituted by 3rd party
Show with Nebius AI Studio Inference
Showcase LLM tracing
give away free credits

Links

https://nebius.com
Nebius offers NVIDIA H100/H200/GB200 GPU clusters via InfiniBand and Kubernetes orchestration.
https://nebius.com/services/studio-inference-service
Scalable LLM inference service offers ultra-low latency via OpenAI-compatible API.

Tech stack