Minghao Guo1, Qingcheng Zeng2, Xujiang Zhao3, Yanchi Liu3, Wenchao Yu3, Mengnan Du4, Haifeng Chen3, Wei Cheng3*
1Rutgers University 2Northwestern University 3NEC Laboratories America 4NJIT
*Corresponding Author: weicheng@nec-labs.com
“Through countless siftings, though laborious, only when the wild sands are blown away, does gold appear.”
千淘万漉虽辛苦,吹尽狂沙始到金
Minghao Guo1, Qingcheng Zeng2, Xujiang Zhao3, Yanchi Liu3, Wenchao Yu3, Mengnan Du4, Haifeng Chen3, Wei Cheng3*
1Rutgers University 2Northwestern University 3NEC Laboratories America 4NJIT
*Corresponding Author: weicheng@nec-labs.com
Arkiv
and released DeepSieve preprint on arXiv.DeepSieve is a modular Retrieval-Augmented Generation (RAG) framework that enhances LLM-based reasoning across heterogeneous knowledge sources. It introduces a multi-stage information sieving pipeline: Decompose → Route → Reflexion → Fusion. Each subquery is dynamically routed to the most suitable (Tool, Corpus) pair. If retrieval fails, DeepSieve reroutes or replans to ensure robust answers.
DeepSieve executes a multi-hop reasoning DAG where each subquery selects its optimal knowledge route. Reflexion enables recovery from retrieval failure. Fusion aggregates validated answers into a final response.
DeepSieve achieves state-of-the-art performance on MuSiQue, 2Wiki, and HotpotQA under both DeepSeek-V3 and GPT-4o backbones.
Query: What country is the birthplace of Erik Hort a part of?
Pure RAG: Incorrect or hallucinated answer.
DeepSieve Reasoning Chain:
DeepSieve introduces a flexible backbone for modular RAG across heterogeneous knowledge sources. Looking forward, we envision several exciting directions:
profile
descriptions for dynamic tool selection.We believe these extensions will push RAG systems toward personalized, domain-aware, and self-reflective intelligence.
For general inquiries about this work, feel free to contact any of the authors listed above. For technical issues or implementation questions, please open an issue on our GitHub repository.
You're also welcome to join the discussion on LinkedIn:
https://www.linkedin.com/posts/minghao-guo-181b4b168_rag-llm-multihopqa...