If you're brand new to RAG, start with RAG Explained and RAG Architecture. This post is for the point where you've built a RAG pipeline that mostly works… and now you're paying for it in cost,...
Thursday, 08 January 2026 19:00
//
18 minute read
This is Part 5 of the DocSummarizer series, and it's also the culmination of the GraphRAG series and Semantic Search series. We're combining everything into a deployable web application.
🚨🚨 PREVIEW...
Thursday, 01 January 2026 18:00
//
9 minute read
NuGet
npm
.NET
Node.js
This is Part 4 of the DocSummarizer series. See Part 1 for the architecture, Part 2 for the CLI tool, or Part 3 for the deep dive on embeddings.
The hard part of RAG isn't the...
Tuesday, 30 December 2025 17:00
//
16 minute read
Your RAG system is great at "needle" questions: retrieve a few relevant chunks and synthesise an answer. It struggles with two common query types:
Sensemaking: "What are the main themes across this...
Friday, 26 December 2025 12:00
//
21 minute read
📖 Part of the RAG Series: This is Part 4a - core implementation:
Part 1: RAG Origins and Fundamentals - What embeddings are, why they matter
Part 2: RAG Architecture and Internals - Chunking,...
Tuesday, 25 November 2025 11:00
//
23 minute read
📖 Part of the RAG Series: This is Part 4b - search features and UI:
Part 1: RAG Origins and Fundamentals - What embeddings are, why they matter
Part 2: RAG Architecture and Internals - Chunking,...
Tuesday, 25 November 2025 09:00
//
15 minute read
📖 Related to the RAG Series: This article provides a deep dive into Qdrant, the vector database used in:
Part 4: ONNX & Qdrant Implementation - Building semantic search
Part 5: Hybrid Search &...
Sunday, 23 November 2025 13:00
//
9 minute read
📖 Part of the RAG Series: This is Part 5 - production integration patterns:
Part 1: RAG Origins and Fundamentals - What embeddings are, why they matter
Part 2: RAG Architecture and Internals -...
Saturday, 22 November 2025 12:00
//
9 minute read
In Part 1 of this RAG series, we covered the fundamentals of Retrieval-Augmented Generation—what it is, how it works, and the underlying technology (embeddings, vector databases, LLM internals). Now...
Saturday, 22 November 2025 10:00
//
20 minute read
In Part 1 and Part 2 of this series, we covered RAG's origins, fundamentals, and technical architecture. You understand what RAG is, why it matters, and how it works under the hood. Now it's time to...
Saturday, 22 November 2025 10:00
//
21 minute read