Skip to content

feat: initial import#1

Merged
jm-cnaught merged 1 commit into
mainfrom
initial-import
May 20, 2026
Merged

feat: initial import#1
jm-cnaught merged 1 commit into
mainfrom
initial-import

Conversation

@jm-cnaught
Copy link
Copy Markdown
Collaborator

Summary

Single-commit initial import of the AI coding agent adoption study. Repo is private; will be flipped to public after final review.

The previous internal working repo (CNaught-Inc/coding-agent-adoption) stays private for ongoing internal work — this is a clean snapshot built from the working tree of that repo's chore/public-release-prep branch, with no history bleed (no prior commits, no deleted files reachable via git log).

What's in this repo

Pipeline (9 scripts):

  • github_ai_daily.py — daily commit-attribution fetcher (GitHub Search API).
  • fetch_branch_activity.py, fetch_branch_creates.py, fetch_daily_totals.py — BigQuery fetchers for GH Archive signals.
  • estimate_carbon.py — Jegham-framework carbon/energy estimation.
  • dashboard.py — Plotly HTML dashboard.
  • run_pipeline.py — orchestrator (daily/weekly modes, Slack alerting, git commit-back).
  • anomaly_analysis.py — signature-drift scanner.
  • build_carbon_workbook.py — Excel QA workbook.

Intervention analysis (10 modules):

  • Primary estimators: event-window CAR, BEAST changepoint detection, first-difference ITS.
  • Robustness: BSTS, synthetic DiD, compositional VAR, 3-param logistic fits.
  • Worked example: Oct 20, 2025 Claude Code web launch.
  • Vendored synthdid (PyPI build broken on Python 3.14).

Data (7 CSVs, ~2 MB):

  • ~450 days of commit data across 32 tools.
  • ~392 days of BigQuery branch/push signals.
  • Bot-donor pool for SDID.
  • Derived carbon estimates.

Documentation:

  • README.md — three no-credential quickstarts + full credentialed setup.
  • AGENTS.md — operating manual for AI coding agents (file map, autonomous walkthrough, walk-the-user-through-it mode, token-availability cheat sheet, guardrails, footguns).
  • METHODOLOGY.md — full methodology, signal definitions, caveats.
  • CONTRIBUTING.md, LICENSE (Apache 2.0), .env.example.
  • references/ — 9 cited academic papers.

Tests: 95-test pytest suite, runs in ~4s, GitHub Actions CI configured.

Test plan

  • python3 -m pytest tests/ -v — 95 passed
  • python3 dashboard.py builds the dashboard from shipped CSVs (464 days, 30 tools, ~1.6 MB output)
  • python3 run_intervention_analysis.py produces outputs/intervention/ with 12 artifacts; headline CAR = 60,067 commits, z = 5.59
  • Secret scan against all tracked files — clean
  • Dogfood test passed: a fresh subagent constrained to only the public docs successfully built the dashboard, ran the intervention analysis, and ran the full test suite

Before flipping to public

  • Decide whether to leave the repo name as-is (coding-agent-emissions) or pick something else.
  • Add GH_SEARCH_TOKEN, GCP_SA_KEY, optional SLACK_WEBHOOK_URL as repo secrets if scheduled CI should run after publication.
  • Verify the .github/workflows/ schedules are in the state you want before public.
  • Flip visibility: Settings → General → Danger Zone.

🤖 Generated with Claude Code

Multi-signal estimate of AI coding agent adoption from public GitHub data.

Includes:
- Core pipeline: github_ai_daily.py, three BigQuery fetchers, estimate_carbon.py,
  dashboard.py, run_pipeline.py, anomaly_analysis.py, build_carbon_workbook.py.
- Intervention analysis: event-window CAR (primary), BEAST changepoint, first-diff
  ITS, BSTS, synthetic DiD; vendored synthdid.py; worked example around the
  Oct 20, 2025 Claude Code web launch.
- 7 shipped CSVs (~450 days commits, ~392 days BigQuery signals, donor pool).
- Documentation: README.md, AGENTS.md (operating manual for AI agents),
  CLAUDE.md (imports AGENTS.md so Claude Code picks up the same instructions),
  METHODOLOGY.md, CONTRIBUTING.md, .env.example.
- 95-test pytest suite + .github/workflows/ for daily/weekly/CI.
- 9 reference PDFs of cited academic papers.

License: MIT.
@jm-cnaught jm-cnaught merged commit 7e59078 into main May 20, 2026
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant