Architecting • AI Engineering Compendium

Part III: Engineering Intelligent Systems

RAG pipeline, best practices, and agent architectures.

Explore the main stages of RAG.

🗂️

Chunking, embeddings, and vector DB indexing.

🔍

Similarity search finds relevant chunks.

✍️

Augment the prompt and generate grounded answers.

Semantic or heading-aware chunking with modest overlap.

Dense vectors + BM25 for exact terms.

Cross-encoder rerank top-K results.

Use metadata to narrow scope.

Return source URLs/IDs with spans.

Measure precision/recall and faithfulness.