Popular repositories Loading
-
ai-career-coach
ai-career-coach PublicResume-grounded AI career coach — multi-agent RAG with a preregistered, falsifiable LLM-as-judge eval benchmark and adversarial red-team.
TypeScript
-
plimsoll
plimsoll PublicDeterministic, zero-dependency CLI that catches AI-agent regressions in CI from recorded traces: policy + baseline checks, no LLM judge, no account.
Python
-
toffoli
toffoli PublicThe undo layer for AI agents — classifies each agent action reversible / compensable / irreversible, plans the restitution, and escalates only what truly can't be undone. Measured as an eval.
TypeScript
-
pacioli
pacioli PublicDouble-entry bookkeeping for AI agents — reconcile what your agent claimed against what the evidence shows, and get a receipt.
TypeScript
-
coehoorn
coehoorn PublicAdversarial red-teaming for chat and tool-using agents — every failure cited to the turn that proves it.
Python
If the problem persists, check the GitHub status page or contact support.