E²GraphRAG: Streamlining Graph-based RAG for High Efficiency and Effectiveness

E²GraphRAG is a lightweight and modular framework designed to enhance both efficiency and effectiveness in Graph-based Retrieval-Augmented Generation (RAG). It streamlines the pipeline from document parsing to answer generation via structured graph reasoning.

📁 Project Structure

.
├── README.md
├── requirements.txt
├── main.py
├── build_tree.py
├── dataloader.py
├── extract_graph.py
├── GlobalConfig.py
├── process_utils.py
├── prompt_dict.py
├── query.py
└── utils.py

📦 Datasets

We use data from:

📚 NovelQA Partly open-source, to obtain the full dataset, please access via a request to the original authors.
🔁 InfiniteBench Fully open-source and publicly available.

You can find how to obtain the data in the ./data/README.md.

Note: After obtaining the datasets, specify the data path when initializing the Dataloader class.

🚀 Getting Started

1. Install Dependencies

Ensure your environment is set up by installing the required packages:

pip install -r requirements.txt

2. Run the Pipeline

The entire pipeline—tree construction, graph extraction, and answer generation—is executed via main.py.

Step-by-step:

Create a config file

Prepare a YAML configuration file to define key parameters.

👉 Example: ./configs/example_config.yaml

Run the pipeline

bash
python main.py --config <path_to_config_file>

📬 Contact & Citation

If you use this code or find it helpful in your research, please consider citing our work. For questions or dataset access (NovelQA), please contact the original authors.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

E²GraphRAG: Streamlining Graph-based RAG for High Efficiency and Effectiveness

📁 Project Structure

📦 Datasets

🚀 Getting Started

1. Install Dependencies

2. Run the Pipeline

📬 Contact & Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 220 Commits
configs		configs
data		data
.gitignore		.gitignore
README.md		README.md
build_tree.py		build_tree.py
dataloader.py		dataloader.py
extract_graph.py		extract_graph.py
main.py		main.py
main_cacheready.py		main_cacheready.py
process_utils.py		process_utils.py
prompt_dict.py		prompt_dict.py
query.py		query.py
requirements.txt		requirements.txt
utils.py		utils.py

Folders and files

Latest commit

History

Repository files navigation

E²GraphRAG: Streamlining Graph-based RAG for High Efficiency and Effectiveness

📁 Project Structure

📦 Datasets

🚀 Getting Started

1. Install Dependencies

2. Run the Pipeline

📬 Contact & Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages