AI Learn
IntermediateFeatured

Build Production RAG Systems

Ship document Q&A and support bots that customers trust. This course walks you through the full production RAG stack—from ingestion and chunking through embeddings, hybrid retrieval, pipeline assembly, and deployment with eval and cost controls. Every module ends with a hands-on lab you can drop into your own codebase. **What you'll build:** - A chunking pipeline with overlap and metadata - Vector search with cosine similarity - Hybrid retrieval with reranking and MRR evaluation - A complete context assembly layer for LLM prompts - Production monitoring, eval harnesses, and cost dashboards Built for engineers who need RAG that works on real PDFs, wikis, and ticket archives—not toy demos.

28 hours 0 students 5 modules · 25 lessons 5 hands-on exercises

Instructor: Dr. Sarah Chen

Course Curriculum

Module 1: Document Ingestion & Chunking

By the end of this module you'll build a production chunking pipeline with overlap, metadata extraction, and idempotent ingestion.

  • Why Ingestion Makes or Breaks RAG
  • Chunking Strategies & Metadata Design
  • Parsing Pipelines for PDF, HTML & Wikis
  • Idempotency, CDC & Ingestion at Scale
  • Lab: Sliding-Window ChunkerLab
  • Module 1: Ingestion & ChunkingQuiz

Module 2: Embeddings & Vector Search

By the end of this module you'll implement cosine similarity search and understand how to scale vector retrieval to production indexes.

  • Embeddings: From Text to Geometry
  • Vector Indexes & Similarity Search at Scale
  • Embedding Model Benchmarking
  • Multi-Tenant Vector Storage & Normalization
  • Lab: Cosine Similarity SearchLab
  • Module 2: Embeddings & Vector SearchQuiz

Module 3: Retrieval Quality & Hybrid Search

By the end of this module you'll implement MRR evaluation and design hybrid retrieval pipelines that beat pure vector search.

  • Reranking, Hybrid Search & When Vectors Fail
  • Evaluating Retrieval: MRR, Recall@K, and Golden Sets
  • Query Transformation & Retrieval Tuning
  • Filtered Search, ACL & Domain Retrieval
  • Lab: Mean Reciprocal Rank (MRR)Lab
  • Module 3: Retrieval QualityQuiz

Module 4: Building the RAG Pipeline

By the end of this module you'll build a context assembly layer that turns retrieved chunks into reliable LLM prompts.

  • End-to-End RAG Architecture
  • Context Assembly, Citations & Faithfulness
  • Multi-Turn RAG & Query Rewriting
  • Caching, Streaming & Latency Optimization
  • Lab: Context AssemblerLab
  • Module 4: RAG PipelineQuiz

Module 5: Production Deployment & Operations

By the end of this module you'll design eval harnesses, monitoring dashboards, and cost controls for production RAG.

  • Eval Pipelines: Retrieval to Answer Quality
  • Monitoring, Cost Control & Incident Response
  • RAG Security & Document Poisoning
  • Blue-Green Indexing & Re-Index Strategy
  • Lab: RAG Cost EstimatorLab
  • Module 5: Production OperationsQuiz

$94.99

Lifetime access · Certificate included

  • Hands-on coding exercises
  • Module knowledge check quizzes
  • Downloadable resources
  • Progress tracking dashboard