benefind

Human-auditable screening pipeline for tax-exempt nonprofits, combining deterministic methods with selective LLM verification to produce precise, explainable, reproducible decisions.

Warning

Work in progress: benefind is currently under active development. Data formats, scoring logic, and CLI commands may change as we iterate.

Built by Verein für Menschen to support beneficiary partner selection for Höhenmeter für Menschen, a charity run in Winterthur.

The current workflow is tailored to Swiss public-source nonprofit screening and can be adapted to similar decision-support contexts.

What it does

benefind takes the official Canton Zurich list of tax-exempt nonprofit organizations and:

Parses the PDF into structured data
Filters to organizations in Bezirk Winterthur
Discovers each organization's website via search
Enriches from ZEFIX (UID, legal form, status, purpose) and supports focused manual ZEFIX review
Guesses legal form from organization names when ZEFIX has no match
Prepares + reviews scrape readiness to ensure safe URL targets
Scrapes key pages (respecting robots.txt)
Reviews scrape quality, then cleans duplicate intra-org content segments

Wherever uncertainty arises, items are flagged for manual review rather than silently decided.

This project is decision support, not automatic judgment. It favors conservative, inspectable steps over opaque end-to-end prompting: uncertain cases are surfaced for human review, and automated decisions are backed by saved evidence and metadata.

Why this project matters

Finding suitable charity partners manually is time-consuming. benefind helps the team:

reduce repetitive screening work
keep decisions transparent and reviewable
focus human attention on ambiguous cases
move from raw public data to an actionable shortlist

The goal is practical decision support for high-stakes shortlisting tasks where accuracy and auditability matter more than fully automatic throughput.

Documentation

Current status

This repository is in an active iteration phase.

some heuristics are intentionally conservative
manual review is a first-class step, not an exception
subset-first iteration is supported (benefind subset + incremental benefind extend)
prompts and thresholds are still being tuned with real-world examples
docs and developer ergonomics are actively being improved
implementation choices prioritize reliable outcomes and auditability over general-purpose abstraction

License

GPL-3.0

Name		Name	Last commit message	Last commit date
Latest commit History 74 Commits
assets/fonts/manrope		assets/fonts/manrope
config		config
data		data
docs		docs
preview/review-pdf		preview/review-pdf
scripts		scripts
src/benefind		src/benefind
tests		tests
.env.example		.env.example
.gitignore		.gitignore
.python-version		.python-version
AGENTS.md		AGENTS.md
LICENSE.md		LICENSE.md
README.md		README.md
index.html		index.html
package-lock.json		package-lock.json
package.json		package.json
pyproject.toml		pyproject.toml
vite.review-pdf.config.mjs		vite.review-pdf.config.mjs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

benefind

What it does

Why this project matters

Documentation

Current status

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

benefind

What it does

Why this project matters

Documentation

Current status

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages