Someone Removed the Vector Database from RAG and Got Better Results

Summary

Tech with Mak presents PageIndex, an open-source alternative to traditional vector database-based RAG (Retrieval-Augmented Generation) systems. The core argument challenges a fundamental assumption in modern RAG: that semantic similarity equals relevance. Traditional RAG systems chunk documents into pieces, embed them as vectors, and retrieve based on similarity scores—an approach that fails dramatically for professional documents where the answer might be buried in an appendix with zero semantic overlap to the user's query. PageIndex, built by VectifyAI and inspired by AlphaGo's tree search algorithm, replaces vector similarity with structured reasoning. It builds a hierarchical tree index (an intelligent table of contents) from documents and enables LLMs to reason through that structure the way human experts would navigate a document. Rather than asking "what text looks similar?" it asks "where would an expert look for this information?" The system achieves 98.7% accuracy on FinanceBench, dramatically outperforming general-purpose LLMs like Perplexity (45%) and GPT-4o (31%) on the same benchmark. Beyond accuracy metrics, PageIndex eliminates the operational complexity of vector databases: no vector embeddings to generate and maintain, no arbitrary chunking that breaks cross-document context, and crucially, full traceability to exact page and section references rather than opaque similarity scores. The post positions this as a fundamental rethinking of how retrieval should work in professional contexts, moving from pattern-matching to reasoning-based navigation.

Key Takeaways

About

Sentiment / Tone

Enthusiastically convinced but technically precise. The post presents a bold claim with specific evidence and benchmarks, yet avoids hyperbolic absolutism. The rhetorical tone moves from observation ("someone removed...") to teaching (explaining how traditional RAG works) to solution (PageIndex as reasoning-based alternative). The author positions this not as "vectors are bad" but as "similarity is not sufficient for professional documents"—a more nuanced take. The writing is accessible yet technically rigorous, using concrete examples (debt trends in Q3) to illustrate abstract problems. There's underlying confidence that this represents a paradigm shift, but the framing acknowledges real trade-offs rather than claiming universal superiority.

Related Links

Research Notes

Tech with Mak is a technology educator focused on databases, ML systems, and production engineering. The post touches on a broader 2025-2026 industry discussion questioning whether vector databases are always necessary for RAG. Multiple independent sources (Writers.com, DigitalOcean, Meilisearch) published critical analyses of vector database RAG during this period, suggesting a genuine shift rather than contrarian take. PageIndex is genuinely open-source by VectifyAI with 1,374+ GitHub stars. The 98.7% FinanceBench result is real but requires contextualization: it compares a specialized financial RAG system (Mafin 2.5) against general-purpose LLMs on a specialized benchmark—not against other optimized RAG systems. Critical reactions indicate PageIndex works best as a specialized tool, not universal replacement. Acknowledged limitations include: latency concerns for large deployments, challenges with multi-document queries, and token cost trade-offs. The core innovation—using LLM reasoning over document structure instead of similarity—is genuine and represents meaningful architectural choice, but it's unsuitable for all use cases, particularly broad searches across unstructured content or very large document collections. The post's framing as "better results" is somewhat marketing-focused; more precisely, PageIndex achieves better results for a specific class of problems while potentially being worse for others.

Someone Removed the Vector Database from RAG and Got Better Results

Summary

Key Takeaways

About

Sentiment / Tone

Related Links

Research Notes

Topics