
From Training Language Models to Training DeepSeek-R1
Reasoning Models #1 - An overview of training
You probably already understand the potential of reasoning models. Playing around with O1 or DeepSeek-R1 shows us these models' enormous promise. As enthusiasts, we are all curious to build something like these models.
We all start on this path, too. However,