LLM Incident Review Console

FastAPI incident console for reviewing AI failures with timeline evidence, replay previews, and mitigation planning.

Why this project exists

Most AI demos stop at generation. Real systems need incident review after things go wrong:

prompt injection attempts
hallucinated SQL or tool calls
policy bypass attempts
grounding failures
unsafe replay loops

This project models the review layer after the incident already happened. It gives teams a place to inspect evidence, understand the runtime timeline, and plan a safe replay.

What it includes

incident queue with severity, impact, and service ownership
timeline events across retrieval, reasoning, policy, and response stages
evidence records for prompt fragments, tool requests, and invalid SQL drafts
replay preview modes for safe re-runs
mitigation tasks with owner and priority
static dashboard for portfolio and demo screenshots

API

GET /health
GET /teams
GET /playbooks
GET /incidents
GET /incidents/{incident_id}
GET /incidents/{incident_id}/timeline
GET /incidents/{incident_id}/evidence
GET /incidents/{incident_id}/mitigations
GET /incidents/{incident_id}/replay-preview

Quick start

python -m pip install -e .
python -m uvicorn llm_incident_review_console.main:app --reload

Open:

http://127.0.0.1:8000/dashboard
http://127.0.0.1:8000/incidents

Example incidents

inc_1001 prompt injection attempt in agent-runtime-control-tower
inc_1002 hallucinated SQL draft in danex-rag-service

Proof Assets

What you can inspect immediately:

dashboard proof: output/playwright/screen-01-dashboard.png
health proof: output/playwright/screen-02-health-proof.png
operations proof: output/playwright/screen-03-ops-proof.png
product framing: output/playwright/screen-04-product-proof.png
architecture notes: docs/ARCHITECTURE.md
case study: docs/CASE_STUDY.md

What This Actually Proves

the backend can model post-incident AI forensics, not only request-time inference
seeded evidence is grouped into timeline, mitigation, and replay surfaces that a reviewer can inspect quickly
the project has a live dashboard, Docker packaging, CI, and screenshot-ready artifacts instead of README-only claims

Positioning

This is not another chatbot demo. It is incident forensics for AI systems: the layer between "something went wrong" and "we understand why it went wrong and how to replay safely."

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.github/workflows		.github/workflows
docs		docs
output/playwright		output/playwright
src/llm_incident_review_console		src/llm_incident_review_console
tests		tests
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
docker-compose.yml		docker-compose.yml
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM Incident Review Console

Why this project exists

What it includes

API

Quick start

Example incidents

Proof Assets

What This Actually Proves

Positioning

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

LLM Incident Review Console

Why this project exists

What it includes

API

Quick start

Example incidents

Proof Assets

What This Actually Proves

Positioning

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages