RAG for Implementers: CPU-Friendly Semantic Search with ONNX and Qdrant
📖 Part of the RAG Series: This is Part 4a - core implementation: Part 1: RAG Origins and Fundamentals - What embeddings are, why they matter Part 2: RAG Architecture and Internals - Chunking,...

Tuesday, 25 November 2025 11:00

//

23 minute read

RAG for Implementers: Semantic Search in Action
2025-11-25Add T09:00 Introduction 📖 Part of the RAG Series: This is Part 4b - search features and UI: Part 1: RAG Origins and Fundamentals - What embeddings are, why they matter Part 2: RAG...

Monday, 24 November 2025 20:41

//

15 minute read

Self-Hosted Vector Databases with Qdrant: A Deep Dive
📖 Related to the RAG Series: This article provides a deep dive into Qdrant, the vector database used in: Part 4: ONNX & Qdrant Implementation - Building semantic search Part 5: Hybrid Search &...

Sunday, 23 November 2025 13:00

//

9 minute read

RAG for Implementers: Hybrid Search and Automatic Indexing
📖 Part of the RAG Series: This is Part 5 - production integration patterns: Part 1: RAG Origins and Fundamentals - What embeddings are, why they matter Part 2: RAG Architecture and Internals -...

Saturday, 22 November 2025 12:00

//

8 minute read

RAG in Practice: Building Real-World Applications
In Part 1 of this RAG series, we covered the fundamentals of Retrieval-Augmented Generation—what it is, how it works, and the underlying technology (embeddings, vector databases, LLM internals). Now...

Saturday, 22 November 2025 10:00

//

20 minute read

RAG in Practice: Building Real-World Applications
In Part 1 and Part 2 of this series, we covered RAG's origins, fundamentals, and technical architecture. You understand what RAG is, why it matters, and how it works under the hood. Now it's time to...

Saturday, 22 November 2025 10:00

//

21 minute read

RAG Architecture and Internals: How It Really Works
In Part 1, we covered RAG's origins, fundamentals, and why it matters. You understand the high-level concept: retrieve relevant information, then use it to generate responses. Now we dive deep into...

Saturday, 22 November 2025 09:30

//

24 minute read

RAG Explained: Origins and Fundamentals
Ever searched for "deployment guide" and got nothing, even though there's an article about "publishing to production"? RAG (Retrieval-Augmented Generation) solves this by understanding meaning, not...

Saturday, 22 November 2025 09:00

//

16 minute read

logo

© 2025 Scott Galloway — Unlicense — All content and source code on this site is free to use, copy, modify, and sell.