CODETRAVERSE

CODETRAVERSE is a cross-language static code analysis tool that extracts structural components and dependencies from code repositories in Haskell, Rescript, Typescript, Rust, golang, python.

Given any supported codebase, CODETRAVERSE outputs:

Per-file JSON summaries (fdep/)
Full call and type dependency graphs in GraphML and gpickle formats

This enables rich code visualization, impact analysis, dependency inspection, and more.

🚀 Features

Multi-language Support: Haskell, Rescript, Typescript, Rust, golang, python (easily extensible!)
Scalable: Handles arbitrarily nested folder structures
Rich Outputs:
- JSON component files for every source file
- repo_function_calls.graphml (visual graph for tools like Gephi, yEd, etc.)
- repo_function_calls.gpickle (Python-native NetworkX graph)
Graph Querying: Use path.py to:
- Find the shortest path between two components in the graph
- List all direct neighbors (incoming and outgoing edges) for a given component

🛠️ Installation

1. Clone the Repository

git clone https://github.com/juspay/codetraverse.git
cd codetraverse

2. Install Dependencies

We recommend using a virtual environment:

python3 -m venv venv
source venv/bin/activate

# Install all required Python packages
pip install -r requirements.txt

⚡ Usage

Basic Usage

python main.py --ROOT_DIR=/path/to/your/repo --LANGUAGE=golang

Replace /path/to/your/repo with the root directory of your code repository
Set --LANGUAGE to one of: haskell, rescript, typescript, rust, golang, python

Outputs will be saved in:

fdep/ (JSON component summaries)
graph/repo_function_calls.graphml
graph/repo_function_calls.gpickle

All Options

Argument	Type	Description
`--ROOT_DIR`	str	Path to the root of the source code repository (required)
`--LANGUAGE`	str	One of `haskell`, `rescript`, `typescript`, `rust`, `golang`, `python` (required)
`--OUTPUT_BASE` (opt)	str	Output directory for per-file JSONs (default: `fdep`)
`--GRAPH_DIR` (opt)	str	Output directory for the graphs (default: `graph`)

Example

python main.py --ROOT_DIR =/my-go-project --LANGUAGE=golang

⚡ Graph Path Usage

Querying the Graph with `path.py`

Use path.py to query the generated graph for relationships between components.

Find the Shortest Path Between Two Components

python path.py --GRAPH_PATH=/path/to/repo_function_calls.graphml --COMPONENT=<target_component> --SOURCE=<source_component>

Replace /path/to/repo_function_calls.graphml with the path to your graph file.
Replace <target_component> with the fully-qualified ID of the target component (e.g., PgIntegrationApp::make).
Replace <source_component> with the fully-qualified ID of the source component.

Example:

python path.py --GRAPH_PATH=graph/repo_function_calls.graphml --COMPONENT=PgIntegrationApp::make --SOURCE=PgIntegrationApp::init

List Direct Neighbors of a Component

python path.py --GRAPH_PATH=/path/to/repo_function_calls.graphml --COMPONENT=<target_component>

Replace /path/to/repo_function_calls.graphml with the path to your graph file.
Replace <target_component> with the fully-qualified ID of the target component.

Example:

python path.py --GRAPH_PATH=graph/repo_function_calls.graphml --COMPONENT=PgIntegrationApp::make

All Options

Argument	Type	Description
`--GRAPH_PATH` or `-g`	str	Path to the saved graph (`.gpickle` or `.graphml`)
`--COMPONENT` or `-c`	str	Target fully-qualified component ID (e.g., `PgIntegrationApp::make`)
`--SOURCE` or `-s`	str	(Optional) Source fully-qualified ID. If omitted, lists direct neighbors.

Output Examples

Shortest Path

If a path exists between the source and target components, the output will look like:

Shortest path from 'PgIntegrationApp::init' → 'PgIntegrationApp::make':
  PgIntegrationApp::init → PgIntegrationApp::process → PgIntegrationApp::make

Direct Neighbors

If listing direct neighbors, the output will show incoming and outgoing edges:

Nodes with edges INTO 'PgIntegrationApp::make' (2):
  PgIntegrationApp::process --[calls]--> PgIntegrationApp::make
  PgIntegrationApp::init --[calls]--> PgIntegrationApp::make

Nodes with edges OUT OF 'PgIntegrationApp::make' (1):
  PgIntegrationApp::make --[uses_type]--> PgIntegrationApp::ResultType

🔍 How it Works

Extract: Recursively walks your codebase, parsing each file, extracting functions, types, variables, dependencies, etc.
Component Output: For every file, generates a detailed .json file in the fdep/ directory.
Unification and Graph Construction: Merges all components into a single global call/type graph using NetworkX.
Export: Writes the graph to:
- repo_function_calls.graphml (portable to any graph tool)
- repo_function_calls.gpickle (for Python/NetworkX use)

🗂️ Output Files

`fdep/`

Per-source-file JSONs, containing extracted component data.

`graph/repo_function_calls.graphml`

Universal call/type dependency graph in GraphML (XML) format.

`graph/repo_function_calls.gpickle`

NetworkX-serialized Python object for fast downstream analysis.

❓ FAQ

Q: Does this overwrite my source code?
A: No. All outputs are written to fdep/ and graph/.

Q: Can I use this for closed-source or private code?
A: Yes! All code runs locally.

Q: What if I want to re-run and overwrite outputs?
A: Just rm -rf fdep graph before running again.

Q: What if I want to visualize the graph?
A: Use Gephi, yEd, or any tool that supports GraphML.

Name		Name	Last commit message	Last commit date
Latest commit History 149 Commits
.github/workflows		.github/workflows
bridge		bridge
codetraverse.egg-info		codetraverse.egg-info
codetraverse		codetraverse
sample_code_repo_test		sample_code_repo_test
scripts		scripts
tests		tests
.gitignore		.gitignore
.python-version		.python-version
MANIFEST.in		MANIFEST.in
README.md		README.md
main.py		main.py
package-lock.json		package-lock.json
package.json		package.json
pyproject.toml		pyproject.toml
setup.py		setup.py
testing.py		testing.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CODETRAVERSE

🚀 Features

🛠️ Installation

1. Clone the Repository

2. Install Dependencies

⚡ Usage

Basic Usage

All Options

Example

⚡ Graph Path Usage

Querying the Graph with `path.py`

Find the Shortest Path Between Two Components

List Direct Neighbors of a Component

All Options

Output Examples

Shortest Path

Direct Neighbors

🔍 How it Works

🗂️ Output Files

`fdep/`

`graph/repo_function_calls.graphml`

`graph/repo_function_calls.gpickle`

❓ FAQ

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CODETRAVERSE

🚀 Features

🛠️ Installation

1. Clone the Repository

2. Install Dependencies

⚡ Usage

Basic Usage

All Options

Example

⚡ Graph Path Usage

Querying the Graph with path.py

Find the Shortest Path Between Two Components

List Direct Neighbors of a Component

All Options

Output Examples

Shortest Path

Direct Neighbors

🔍 How it Works

🗂️ Output Files

fdep/

graph/repo_function_calls.graphml

graph/repo_function_calls.gpickle

❓ FAQ

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Querying the Graph with `path.py`

`fdep/`

`graph/repo_function_calls.graphml`

`graph/repo_function_calls.gpickle`

Packages