AI · LLM

A Deep Dive into RAG: What It Actually Does

Gokula Prasanth · Mar 20, 2026 · 11 min read

Retrieval-Augmented Generation is one of the most practical LLM patterns to emerge in the last two years. The idea is simple: instead of relying on a model's parametric memory, you fetch relevant context at query time and stuff it into the prompt.

The pipeline

At a high level: chunk documents, embed chunks, index embeddings, then at query time embed query, find nearest neighbours, inject into prompt, and generate answer. Each of those steps is a place things can go wrong.

0 Comments

CSS Architecture That Does Not Haunt You

Feb 18, 2026

Improving Search with Sentence Embeddings

Mar 5, 2026

Why I Still Choose PHP for Personal Projects

Apr 2, 2026

How I Built a Recommendation Engine from Scratch

Apr 14, 2026

A Deep Dive into RAG: What It Actually Does

The pipeline

0 Comments

Leave a comment