Skip to content

Multi-Agent Debate and Agent Forest pipeline variants#2

Draft
tamara-kostova wants to merge 8 commits into
mainfrom
deep-research
Draft

Multi-Agent Debate and Agent Forest pipeline variants#2
tamara-kostova wants to merge 8 commits into
mainfrom
deep-research

Conversation

@tamara-kostova

@tamara-kostova tamara-kostova commented Apr 21, 2026

Copy link
Copy Markdown
Owner
  • System B - Multi-Agent Debate (--pipeline_mode debate): three MedGemma advocate agents argue on behalf of CNN, BiomedCLIP, and SAM3 outputs; a fourth MedGemma instance judges. Supports 1–3 rounds where advocates see the prior verdict and counter-argue. Replaces the verification + report tail.
  • System C - Agent Forest (--pipeline_mode forest): N role-specialized MedGemma instances (radiologist, conservative, emergency, differential) independently diagnose the scan; majority vote + confidence-weighted tiebreaking produces the consensus routing decision. Replaces the single triage node; all downstream nodes unchanged.
  • Two new research sweep families: agent_forest (N ∈ {1, 3, 4}) and debate_rounds (R ∈ {1, 2, 3}).
  • Two new analysis functions: forest_voting_analysis (dissent rate vs. accuracy) and debate_round_analysis (verdict stability vs. ECE).
  • evaluate.py extended to capture dissent_rate, vote_fraction, debate_rounds_completed, debate_round_changed per sample.

@tamara-kostova tamara-kostova changed the title Deep Research Multi-Agent Debate and Agent Forest pipeline variants Jun 2, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant