ScholarAgent: An Advanced Multi-Agent Research Assistant

ScholarAgent is a sophisticated multi-agent system designed to perform deep, relational reasoning over a corpus of scientific papers. It moves beyond standard RAG by building and querying a dynamic Knowledge Graph, allowing it to answer complex questions that require synthesizing information across multiple documents and their relationships.

Demo

(A demonstration of the agent answering a complex query using the Knowledge Graph tool.)

Key Features

Automated Knowledge Graph Construction: A Makefile-driven pipeline that automatically fetches papers from arXiv, enriches them with data from Semantic Scholar, extracts key concepts, and builds a comprehensive Neo4j Knowledge Graph.
Intelligent Multi-Agent System: Built with LangGraph, the system uses a Manager agent to intelligently route complex queries to specialized tools.
Hybrid Toolset for Deep Reasoning:
- Advanced RAG Tool: For content-based questions, using a retrieve-then-rerank pipeline for high-quality context.
- Knowledge Graph Tool: For relational questions, using a powerful gemini-1.5-pro model to translate natural language into precise Cypher database queries.
Tiered LLM Strategy: Utilizes the efficient gemini-1.5-flash for general tasks and the powerful gemini-1.5-pro for high-stakes reasoning, balancing performance and cost.
Fully Tested and Type-Hinted: A robust test suite built with pytest and a modern, type-hinted codebase enforced by pre-commit hooks.

Architecture Overview

The system is split into two core components: an offline Data Pipeline that builds the knowledge base, and an online Agentic System that uses it to answer questions.

graph TD
    subgraph "Offline: Data Pipeline"
        direction LR
        A[External Sources <br> arXiv & Semantic Scholar] --> B{Data Processing Scripts};
        B --> C[Neo4j Knowledge Graph];
        B --> D[ChromaDB Vector Store];
    end

    subgraph "Online: Agentic System"
        direction LR
        E[User Query] --> F{Manager Agent};
        F -- routes to --> G[Knowledge Graph Tool];
        F -- routes to --> H[Advanced RAG Tool];
        G -- queries --> C;
        H -- queries --> D;
        I[Generator Agent]
        G --> I;
        H --> I;
        I --> J[Final Answer];
    end

    classDef source fill:#FFD580,stroke:#666,stroke-width:1.5px,color:#222;
    classDef process fill:#A8E6A3,stroke:#666,stroke-width:1.5px,color:#222;
    classDef storage fill:#9EC9FF,stroke:#666,stroke-width:1.5px,color:#222;
    classDef output fill:#D7B3FF,stroke:#666,stroke-width:1.5px,color:#222;

    class A,E source;
    class B,F,G,H,I process;
    class C,D storage;
    class J output;

Tech Stack

Python

Google Gemini

LangChain

Neo4j

ChromaDB

Pytest

Getting Started

These instructions will get you a copy of the project up and running on your local machine.

Prerequisites:

Python 3.12+
Poetry (for dependency management)
A running Neo4j instance (e.g., via Docker)
A Google AI API Key

Installation:

Clone the repository:

git clone [[https://github.com/](https://github.com/)shilpamusale/scholar-agent.git](https://github.com/shilpamusale/scholar-agent.git)
cd scholar-agent

Create a .env file: Copy the example environment file and add your credentials.

cp .env.example .env
# Now, edit the .env file with your API keys and database URI

Install dependencies:
```
poetry install
```
Build the Knowledge Graph: This command will run the entire data pipeline. This may take some time.
```
make all
```

Usage

Once the knowledge graph is built, you can ask the agent questions from the command line.

Example 1: Relational Query (Knowledge Graph Tool)

python main.py "Who are the most cited authors on the topic of 'sparse autoencoders'?"

Example 2: Content Query (RAG Tool)

python main.py "Summarize the abstract of the paper The Interpretable Dictionary in Sparse Coding'"

Example 3: Complex Hybrid Query

python main.py "How are researchers at Anthropic using dictionary learning for interpretability, particularly in relation to sparse autoencoders?"

Future Work: The Autonomous Research Agent

The current version of ScholarAgent is a robust information retrieval and reasoning system. The next major phase of this project is to evolve it into a fully autonomous research and experimentation agent.

Vision: An AI agent that not only reads new research papers but can also design, run, and analyze experiments to test or extend the ideas presented.

Core Components

Research Planning & Hypothesis Generation: An advanced reasoning layer where the agent can map new papers against existing work, identify open problems or weaknesses, and propose novel, testable experiments.
Automated Experiment Execution: The ability for the agent to autonomously:
- Spin up sandboxed code environments (e.g., via local containers or cloud APIs like Modal).
- Write and execute code to implement baseline models and run comparative benchmarks.
Self-Critique & Iterative Refinement: An analysis module for the agent to interpret its own experimental results using statistical methods, compare them to the paper's original claims, and iteratively refine its hypotheses based on the evidence.

Advanced Capabilities

Multi-Agent Collaboration: Evolving the system into a team of specialized agents: a Literature Reviewer, an Experimenter to run the code, and a Critic to analyze the results and suggest new directions.
Active Tool Learning: Empowering the agent to dynamically identify and learn how to use external tools and libraries (e.g., specific packages from Hugging Face, PyTorch, or statsmodels) as needed to complete its experiments.

Technical Approach

Orchestration & Configuration: The agent's complex workflows will be managed using a robust framework like Hydra for configuration management.
Environments: Experimental environments will be containerized using Docker to ensure reproducibility.
Interpretability & Safety: Mechanistic interpretability will be explored using tools like TransformerLens, while bias mitigation will be implemented with Parameter-Efficient Fine-Tuning techniques like LoRA.

This next phase aims to bridge the gap between understanding research and actively contributing to it.

License

This project is licensed under the Apache 2.0 License. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 70 Commits
.github/workflows		.github/workflows
assets		assets
configs		configs
data/raw		data/raw
notebooks		notebooks
scripts		scripts
src		src
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
conftest.py		conftest.py
inspect_db.py		inspect_db.py
main.py		main.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
test_retriever.py		test_retriever.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ScholarAgent: An Advanced Multi-Agent Research Assistant

Demo

Key Features

Architecture Overview

Tech Stack

Getting Started

Usage

Future Work: The Autonomous Research Agent

Core Components

Advanced Capabilities

Technical Approach

License

About

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ScholarAgent: An Advanced Multi-Agent Research Assistant

Demo

Key Features

Architecture Overview

Tech Stack

Getting Started

Usage

Future Work: The Autonomous Research Agent

Core Components

Advanced Capabilities

Technical Approach

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages