✨ Understand What's Happening At The Frontier of Agentic AI - GOSIM Paris 2026 - May 5-6, 2026 ✨
Filter

AI Infra

Datasets and Infrastructure for DeepSeek-R1 Style Reinforcement Learning (GRPO)

May 6

16:20 - 17:00

Location: Founders Café (Updated)

We will walk through everything you need to know about the latest in reinforcement learning for LLMs, datasets and infrastructure, down to training your own small reasoning LLM that can write code locally.

Speakers