Keynote

Beyond the API: Modern Inference for Modern Workloads

[2026 - Day 2 - Keynote] Practitioners from the companies at the frontier of inference and model development sit down to unpack what application developers need to understand right now: why fine-tuning is quietly resurging under the name "RL," how smart teams are compressing inference costs by shaping smaller specialized models, who owns the model routing problem, and why inference capacity is structurally behind demand — possibly for years. PANELISTS: Tuomas Rintamaki - Research Scientist, NVIDIA Charles Zedlewski - Chief Product Officer, Together AI Charles Frye - Member of Technical Staff, Modal Bryan Bischof - Head of AI, Theory Ventures (Moderator)

Speakers