Carya Eagle Eye

Deal-intelligence system for tracking lead investment signals from target VC funds, extracting structured funding data, and serving that data to a frontend dashboard.

What I Built

Backend ingestion + extraction pipeline for funding/deal signals.
Structured persistence with migration support.
Frontend dashboard for review and drill-down.
Test suite around extraction/storage flows.

Why This Exists

Raw funding content is inconsistent across sources. This project standardizes extraction and tracks lead-signal quality so downstream analysis is reliable.

Architecture

Ingestion/scraping scripts for source collection.
Extraction layer for structured deal parsing.
Storage layer (src/archivist) for normalized persistence.
API + frontend (frontend) for visibility.

Key Tradeoffs

Structured extraction with strict schemas: Reduces bad data drift; tradeoff is extra handling for ambiguous articles.
Source-specific heuristics + shared normalization: Improves accuracy on noisy inputs; tradeoff is ongoing heuristic maintenance.
Clear separation of backend/frontend workspaces: Keeps deployment and debugging cleaner; tradeoff is more coordination across stacks.

Run

Backend

Prerequisites: Python 3.11+ (project requires >=3.11).

python3 -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt

Frontend

Prerequisites: Node.js 18+ and npm dependencies installed.

cd frontend
npm install
npm run dev

Test

Prerequisites: Python 3.11+ and project dependencies installed.

python3 -m pytest tests -q

Troubleshoot

Empty dashboard: verify backend data ingest path and API base config.
Extraction quality drops: review source-specific parsing and confidence thresholds.
Migration issues: check alembic revision state before rerunning ingestion.

Interview Talking Points

How I chose schema strictness vs extraction flexibility.
How I debugged lead-signal misclassification.
Why I split pipeline modules instead of one extraction script.

Related Docs

DECISIONS.md
BUILD_LOG.md
KNOWN_LIMITATIONS.md
DEMO.md
SECURITY.md

Name		Name	Last commit message	Last commit date
Latest commit History 60 Commits
.github/workflows		.github/workflows
alembic		alembic
frontend		frontend
scripts		scripts
src		src
tests		tests
.env.example		.env.example
.gitignore		.gitignore
.gitleaks.toml		.gitleaks.toml
.pre-commit-config.yaml		.pre-commit-config.yaml
BRAVE_COST_OPTIMIZATION.md		BRAVE_COST_OPTIMIZATION.md
BUILD_LOG.md		BUILD_LOG.md
CLAUDE.md		CLAUDE.md
DECISIONS.md		DECISIONS.md
DEMO.md		DEMO.md
DEVELOPMENT.md		DEVELOPMENT.md
Dockerfile		Dockerfile
KNOWN_LIMITATIONS.md		KNOWN_LIMITATIONS.md
README.md		README.md
SECURITY.md		SECURITY.md
alembic.ini		alembic.ini
docker-compose.yml		docker-compose.yml
pyproject.toml		pyproject.toml
railway.toml		railway.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Carya Eagle Eye

What I Built

Why This Exists

Architecture

Key Tradeoffs

Run

Backend

Frontend

Test

Troubleshoot

Interview Talking Points

Related Docs

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Carya Eagle Eye

What I Built

Why This Exists

Architecture

Key Tradeoffs

Run

Backend

Frontend

Test

Troubleshoot

Interview Talking Points

Related Docs

About

Topics

Resources

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages