NLP Archives - Dr Amita Kapoor

An In-Depth Look at Group Relative Policy Optimization (GRPO)

February 24, 2025

In recent months, the DeepSeek team has showcased impressive results by fine-tuning large language models for advanced reasoning tasks using an innovative reinforcement learning technique called Group Relative Policy Optimization (GRPO). In this post, we’ll explore the theoretical background and core principles of GRPO while also offering a primer on Reinforcement Learning (RL) and its …

Elevating Large Language Models with Retrieval-Augmented Generation

March 4, 2025February 23, 2025

Dive deep into Retrieval-Augmented Generation (RAG) models. Understand their architecture, implementation with LlamaIndex, and diverse applications Picture this: a computer that can write like a person, but with the superpower of reading through a whole library in seconds to give the best answers possible. Welcome to the world of Retrieval-Augmented Generation or RAG for short. …