Skip to content

OpenReview pilot benchmark (ICLR 2025) + eval tooling#65

Open
jwang1230 wants to merge 1 commit intoChicagoHAI:mainfrom
jwang1230:openreview-benchmark
Open

OpenReview pilot benchmark (ICLR 2025) + eval tooling#65
jwang1230 wants to merge 1 commit intoChicagoHAI:mainfrom
jwang1230:openreview-benchmark

Conversation

@jwang1230
Copy link
Copy Markdown

Adds benchmarks/openreview_benchmark: 10-paper ICLR 2025 pilot JSONL, scripts (collect/normalize/filter/PDFs/validate/eval), OPENREVIEW.md + REPORT.md, locked eval in reports/, eval_history.jsonl, and src/reviewer/evaluate_openreview.py for LLM-judge P/R/F1. Ignores openreview_raw, pdfs, and results/ locally.

- Pilot JSONL, scripts, docs, locked eval report and eval_history

- evaluate_openreview.py; .gitignore for local OpenReview paths
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant