I’ve been running a fairly standard RAG pipeline for internal docs search: chunk documents, embed with an embedding model, store in Pinecone, retrieve top-k, stuff into prompt. It works… okay. But I keep hitting cases where the retrieved chunks miss important context because the relationships between concepts live across different documents.
For example, our engineering docs reference architecture decisions that link to security policies that reference compliance requirements. Pure vector similarity grabs the most semantically similar chunks, but it has no concept of “this document references that other document” or “these two things are related through a shared project.”
I’ve been reading about GraphRAG and hybrid retrieval approaches that combine vector search with knowledge graphs. The idea is you build a graph of entities and relationships from your corpus, then use both graph traversal AND vector similarity during retrieval. Some teams are reporting retrieval precision jumping from ~75% to 95%+ on complex queries.
But everything I’ve seen is either a research paper or a vendor pitch. I’m looking for real-world experiences:
- How painful was it to build and maintain the knowledge graph? Did you use an LLM to extract entities automatically or did you hand-curate?
- What graph database are you using? Neo4j? Something else?
- How does the query latency compare to pure vector search?
- Is the accuracy improvement worth the added complexity for a team of 3-4 engineers?
My stack is Python, LangChain, and Pinecone. I’m open to adding a graph layer but I don’t want to over-engineer this if the marginal accuracy gains are small for typical enterprise doc search.
Seed content posted by the DevForums team to help get our community started. Have a better answer? Jump in!