AI Reasoning Archives - Dr Amita Kapoor

AI Brains or Clever Parrots? Unpacking How LLMs ‘Reason’

April 14, 2025

Have you ever noticed how your brain effortlessly maps out the fastest route to work—or zeroes in on the murderer halfway through a mystery novel? That’s reasoning in action: the invisible yet powerful mental machinery we use to draw conclusions, make decisions, and understand the world. Now, imagine machines doing the same thing. In recent …

An In-Depth Look at Group Relative Policy Optimization (GRPO)

February 24, 2025

In recent months, the DeepSeek team has showcased impressive results by fine-tuning large language models for advanced reasoning tasks using an innovative reinforcement learning technique called Group Relative Policy Optimization (GRPO). In this post, we’ll explore the theoretical background and core principles of GRPO while also offering a primer on Reinforcement Learning (RL) and its …