Datasets and Infrastructure for DeepSeek-R1 Style Reinforcement Learning (GRPO)
May 7
16:20 - 17:00
We will walk through everything you need to know about the latest in reinforcement learning for LLMs, datasets and infrastructure, down to training your own small reasoning LLM that can write code locally.