- DB queries live in
src/poppy/io/db.pyand are configured inconfigs/ontology.yaml. - CSV loaders live in
src/poppy/io/csvs.pyand are configured inconfigs/ontology.yaml. - rdflib enrichment functions live in
src/poppy/ontology/enrich.py. - The orchestrated build is
src/poppy/ontology/pipeline.pywith CLI inscripts/build_ontology.py.
-
CMAUP:
CMAUPv2.0_download_Plant_Ingredient_Associations_allIngredients.txt→ renamePlant_ID→plant_id,Ingredient_ID→compound_idCMAUPv2.0_download_Plants.txt→ renamePlant_ID→plant_id,Scientific_Name→scientific_nameCMAUPv2.0_download_Ingredient_Target_Associations_ActivityValues_References.txt→Ingredient_ID→compound_id,Target_ID→gene_id
-
NPASS:
NPASSv1.0_download_naturalProducts_targetInfo.txt→NP_ID→compound_id,Target→gene_id -
ChEMBL mechanisms:
ChEMBL_drug_mechanisms.csv→molecule_chembl_id→compound_id,target_chembl_id→gene_id -
COCONUT (SQL examples):
SELECT id AS plant_id, name AS scientific_name FROM coconut.organisms;- A compound join across
coconut.properties,coconut.entries,coconut.structures,coconut.organism_map
python -m venv .venv && source .venv/bin/activate
pip install -e ".[dev]"
python scripts/build_ontology.py --config configs/ontology.yaml