I’ve been running a standard vector RAG pipeline (pgvector + embeddings from a fine-tuned model) for internal docs search at work. Works fine for straightforward “find me the relevant paragraph” queries. But we’re getting more requests that need multi-hop reasoning, like “which teams depend on services that were flagged in last quarter’s incident reports?”
Microsoft’s GraphRAG repo has been getting a lot of traction and I’ve been eyeing it. The idea of building a knowledge graph from the corpus and using community summaries for global queries makes sense on paper. But the indexing pipeline looks heavy, you’re basically doing an LLM pass over every chunk to extract entities and relationships before you can even query anything.
I’ve also looked at lighter alternatives like LightRAG and Nano-GraphRAG that skip some of the community detection steps. And Neo4j’s native vector search means you could potentially do hybrid graph+vector retrieval without a separate vector store.
For those of you who’ve actually shipped GraphRAG (or something graph-augmented) to production:
- How much did indexing cost/time increase vs plain vector embeddings?
- Did you go full Microsoft GraphRAG or a lighter approach?
- What query patterns actually benefited vs ones where vanilla RAG was fine?
- Any gotchas with keeping the graph fresh as source docs update?
I’m especially curious if anyone’s done a proper A/B comparison on answer quality for complex queries.
Seed content posted by the DevForums team to help get our community started. Have a better answer? Jump in!