Invoice & Receipt Processing Platform (OpenEnv + AI Agents)

title

NamastAI

emoji

💻

colorFrom

blue

colorTo

red

sdk

docker

pinned

false

short_description

invoice-env

Invoice & Receipt Processing Platform (OpenEnv + AI Agents)

OpenEnv-compliant environment and full-stack app for invoice automation.

Quickstart For New Contributors

Follow this section if someone is cloning the repo for the first time.

1. Prerequisites

Python 3.10+
Node.js 18+ and npm
Git
Docker Desktop (for container checks)

2. Clone

git clone https://github.com/IshwinderKaur8/invoice-env.git
cd invoice-env

3. Create Python Environment

Windows PowerShell:

python -m venv .venv
.\.venv\Scripts\Activate.ps1
python -m pip install --upgrade pip
pip install -r requirements.txt
pip install -r backend/requirements.txt

macOS/Linux:

python3 -m venv .venv
source .venv/bin/activate
python -m pip install --upgrade pip
pip install -r requirements.txt
pip install -r backend/requirements.txt

4. Configure Environment Variables

Copy example files and edit values as needed:

cp .env.example .env
cp backend/.env.example backend/.env
cp frontend/.env.example frontend/.env

Windows PowerShell equivalent:

Copy-Item .env.example .env
Copy-Item backend/.env.example backend/.env
Copy-Item frontend/.env.example frontend/.env

Minimum required for hackathon inference:

API_BASE_URL
MODEL_NAME
HF_TOKEN

5. Run Core Checks

python -m pytest -q
python scripts/run_baseline.py

6. Run Backend + Frontend Locally

Backend (terminal 1):

uvicorn backend.main:app --reload --host 0.0.0.0 --port 8000

Frontend (terminal 2):

cd frontend
npm install
npm run dev

7. Run Submission Inference

python inference.py

8. Run Pre-Submission Validator

Install validator CLI once:

pip install openenv-core

Run checks:

openenv validate
bash validate-submission.sh <your-space-url>

Core goal: train/evaluate agents to process invoices and receipts by solving three tasks in sequence:

Field extraction (easy)
Expense categorization (medium)
Anomaly detection (hard)

Problem Coverage

Task 1: Field Extraction (Easy)

Observation: raw invoice fields and text context
Action: extract vendor_name, invoice_date
Reward: exact match 0.99; partial credit for fuzzy similarity >= 0.8; minimum 0.01
Grader: deterministic comparison against ground truth

Task 2: Expense Categorization (Medium)

Observation: vendor, description, line-item metadata
Action: assign category from Travel, Office Supplies, Utilities, Misc
Reward: exact match 0.99; 0.5 if correct label appears in top-2 prediction; minimum 0.01
Grader: deterministic category check against labeled data

Task 3: Anomaly Detection (Hard)

Observation: invoice batch context (amount patterns, references, vendor/date behavior)
Action: set anomaly flag for duplicate/high-risk invoices
Reward: continuous score from precision/recall F1 behavior, clamped to (0,1) as 0.01..0.99
Grader: deterministic scoring function derived from confusion counts

OpenEnv Models

Implemented with Pydantic typed models:

class InvoiceObservation(BaseModel):
    vendor_name: str
    invoice_date: str
    amount: float
    description: str
    metadata: Dict[str, Any]

class InvoiceAction(BaseModel):
    extracted_fields: Dict[str, str]
    category: Optional[str] = None
    anomaly_flag: Optional[bool] = None

class InvoiceReward(BaseModel):
    score: float
    details: Dict[str, Any]

Episode Definition

One episode equals processing one invoice batch. Default behavior uses deterministic synthetic invoices with anomalies included.

Environment API

The environment follows OpenEnv-style methods:

reset()
step(action)
state()

Implementation entrypoint and schema wiring are defined in openenv.yaml.

Repository Structure

.
├── env/                     # Core OpenEnv runtime (models, graders, tasks, dataset)
├── scripts/                 # Baseline agent runner (OpenAI + heuristic fallback)
├── tests/                   # Model, grader, and environment tests
├── backend/                 # FastAPI service + Mongo integration
├── frontend/                # React/Vite/Tailwind dashboard
├── openenv.yaml             # OpenEnv metadata and schema mapping
├── .env.example             # Root inference env template
├── validate-submission.sh   # Root wrapper for organizer validator script
├── Dockerfile               # Containerized baseline execution
└── requirements.txt         # Core environment dependencies

Baseline Agent

Two baseline entrypoints are provided:

inference.py (root): hackathon submission script (mandatory filename)
scripts/run_baseline.py: developer helper script with local heuristic mode

scripts/run_baseline.py supports:

BASELINE_MODE=auto (default): OpenAI if API key is present, otherwise heuristic
BASELINE_MODE=openai: strict OpenAI mode
BASELINE_MODE=heuristic: fully offline deterministic mode

Example

python scripts/run_baseline.py

Recent heuristic run output:

Steps: 12
Total score: 8.400
Average score: 0.700

Hackathon Submission Contract

Required Runtime Variables

Set these variables before running inference.py:

API_BASE_URL: API endpoint for the LLM provider
MODEL_NAME: model identifier for inference
HF_TOKEN: API key/token used by OpenAI client
LOCAL_IMAGE_NAME: local image reference if organizer uses image-backed env constructor

Reference file:

.env.example

Optional reproducibility variables:

BATCH_SIZE (default 24)
SEED (default 42)

Mandatory Inference Script

The submission inference script is:

inference.py (at repository root)

It uses the OpenAI Python client for all LLM calls and reads credentials/config from the environment variables above.

Structured Stdout Format

inference.py emits strict tagged logs:

[START] once at run start
[STEP] once per environment step
[END] once at run completion

Example:

[START] task=invoice-processing env=invoice-openenv model=Qwen/Qwen2.5-72B-Instruct
[STEP] step=1 action={"extracted_fields":{"vendor_name":"Amazon","invoice_date":"2026-01-12"},"category":"Office Supplies","anomaly_flag":false} reward=0.70 done=false error=null
[END] success=true steps=24 rewards=0.70,0.70,0.70

Required line schema (field order preserved):

[START] task=<task_name> env=<benchmark> model=<model_name>
[STEP] step=<n> action=<action_str> reward=<0.00> done=<true|false> error=<msg|null>
[END] success=<true|false> steps=<n> rewards=<r1,r2,...,rn>

Run command:

python inference.py

Local Setup

Core Environment

python -m venv .venv
.venv\Scripts\activate
pip install -r requirements.txt
python -m pytest -q
python scripts/run_baseline.py

Backend (FastAPI)

cd backend
pip install -r requirements.txt
uvicorn main:app --reload --host 0.0.0.0 --port 8000

Optional backend environment variables:

OPENAI_API_KEY
OPENAI_MODEL (default gpt-4o-mini)
MONGO_URI
MONGO_DB_NAME
FRONTEND_ORIGIN

Reference file:

backend/.env.example

Frontend (React + Vite)

cd frontend
npm install
npm run dev

Optional frontend environment variable:

VITE_API_BASE_URL (default http://localhost:8000/api)

Reference file:

frontend/.env.example

API Endpoints

POST /api/reset
POST /api/step
GET /api/state
POST /api/run-agent
GET /api/results

These endpoints allow both interactive stepping and full-episode automated agent runs.

Deployment

Docker

The root Dockerfile is configured for containerized API execution and HF Spaces compatibility.

docker build -t invoice-openenv .
docker run --rm -p 7860:7860 invoice-openenv

Health check examples after startup:

curl http://localhost:7860/
curl -X POST http://localhost:7860/reset -H "Content-Type: application/json" -d '{}'
curl -X POST http://localhost:7860/api/reset -H "Content-Type: application/json" -d '{"batch_size": 8}'

If you need the exact organizer command path from repo root:

bash validate-submission.sh <your-space-url>

Hugging Face Spaces

For Spaces Docker deployment:

Use this repository as the Space source.
Select Docker SDK.
Add Space secrets for API_BASE_URL, MODEL_NAME, and HF_TOKEN.
Add openenv tag in Space metadata.
Verify the Space returns 200 on /reset (organizer validator check).

Full-Stack Hosting

Backend: backend/render.yaml (Render)
Frontend: frontend/vercel.json (Vercel)

Validation Status

Deterministic synthetic dataset with duplicates and high-amount anomalies
Deterministic graders for all three tasks
Typed observation/action/reward models
OpenEnv metadata in openenv.yaml
Baseline script with OpenAI + offline fallback
Docker support included
Test suite passing (14 passed)

Pre-Submission Checklist

openenv.yaml includes metadata, task definitions, and typed schema references
env/models.py defines typed Observation/Action/Reward Pydantic models
env/environment.py implements step(), reset(), and state()
3 tasks implemented with deterministic graders and scores in 0.0..1.0
inference.py exists at root and uses OpenAI client + required env variables
Structured logs include [START], [STEP], [END]
Docker image builds and serves API container
/reset responds successfully from deployed Space

Organizer Validator Script

The repository includes organizer-compatible pre-validation helper:

scripts/validate-submission.sh

Name		Name	Last commit message	Last commit date
Latest commit History 54 Commits
backend		backend
env		env
frontend		frontend
scripts		scripts
server		server
tests		tests
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
Dockerfile		Dockerfile
HF_SPACE_DEPLOY_STEPS.txt		HF_SPACE_DEPLOY_STEPS.txt
README.md		README.md
app.py		app.py
inference.py		inference.py
openenv.yaml		openenv.yaml
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
uv.lock		uv.lock
validate-submission.sh		validate-submission.sh

Folders and files

Latest commit

History

Repository files navigation

Invoice & Receipt Processing Platform (OpenEnv + AI Agents)

Quickstart For New Contributors

1. Prerequisites

2. Clone

3. Create Python Environment

4. Configure Environment Variables

5. Run Core Checks

6. Run Backend + Frontend Locally

7. Run Submission Inference

8. Run Pre-Submission Validator

Problem Coverage

Task 1: Field Extraction (Easy)

Task 2: Expense Categorization (Medium)

Task 3: Anomaly Detection (Hard)

OpenEnv Models

Episode Definition

Environment API

Repository Structure

Baseline Agent

Example

Hackathon Submission Contract

Required Runtime Variables

Mandatory Inference Script

Structured Stdout Format

Local Setup

Core Environment

Backend (FastAPI)

Frontend (React + Vite)

API Endpoints

Deployment

Docker

Hugging Face Spaces

Full-Stack Hosting

Validation Status

Pre-Submission Checklist

Organizer Validator Script

About

Resources

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages