Skip to content

feat(pts): wire aact_extraction_batch_results into chembl_molecule (via pis_drug)#207

Merged
DSuveges merged 4 commits into
2606-public-releasefrom
chembl-molecule-aact-source
Jun 11, 2026
Merged

feat(pts): wire aact_extraction_batch_results into chembl_molecule (via pis_drug)#207
DSuveges merged 4 commits into
2606-public-releasefrom
chembl-molecule-aact-source

Conversation

@d0choa

@d0choa d0choa commented Jun 11, 2026

Copy link
Copy Markdown
Collaborator

Summary

Propagates the PTS chembl_molecule config change (opentargets/pts#142, tracking issue opentargets/issues#4414) into orchestration, with the AACT batch downloaded in pis_drug so the DAG dependencies stay unchanged.

  • pts.yamlchembl_molecule step gains the source aact_extraction_batch_results: input/clinical_report/aact_extraction_batch_results (mirrors the PTS config.yaml change).
  • pis.yaml — mirrors feat(drug): download aact_extraction_batch_results in the drug step pis#202: the drug step uses copy_many to copy the AACT batch from its standalone source gs://ot-team/irene/clinical_mining/aact_extraction_batch_result/output/*input/clinical_report/aact_extraction_batch_results/. The clinical_report step is unchanged.
  • unified_pipeline.yaml — no change. The AACT input is staged by pis_drug, which pts_chembl_molecule already depends on.

Based on 2606-public-release (PR #201), not dev.

Test plan

  • pytest tests/test_dag_validation.py — 6 passed
  • Net diff is pts.yaml + pis.yaml only

Related

Mirror the PTS config change (opentargets/pts#142): the chembl_molecule step
now reads input/clinical_report/aact_extraction_batch_results to mine
clinical-trial (AACT) synonyms. That input is staged by pis_clinical_report, so
pts_chembl_molecule now also depends on it in the unified pipeline (otherwise
the DAG could run chembl_molecule before the AACT batch is present).
…_molecule dep

Mirror opentargets/pis#202: download aact_extraction_batch_results in the PIS
drug step (clinical_report glob split into top-level / aact / chembl subtrees to
exclude it). Since pts_chembl_molecule already depends on pis_drug, revert the
earlier pts_chembl_molecule -> pis_clinical_report edge — the DAG dependencies
stay as they were.
@d0choa d0choa changed the title feat(pts): wire aact_extraction_batch_results into chembl_molecule feat(pts): wire aact_extraction_batch_results into chembl_molecule (via pis_drug) Jun 11, 2026
@DSuveges DSuveges merged commit 148ac76 into 2606-public-release Jun 11, 2026
2 checks passed
@DSuveges DSuveges deleted the chembl-molecule-aact-source branch June 11, 2026 13:24
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants