I recently submitted my first research paper. In this post, I share how I prepared it, what challenges I faced, and what I learned from the experience.
This post explores how DeepSeek R1 improves upon DeepSeek R1 Zero by addressing key challenges through strategic fine-tuning and reinforcement learning.