- Long-term stable archive of model inpusts, model, database, and metadata
- Open-acess distribution point
ARTIS uses the The Knowledge Network for Biocomplexity KNB data repository to archive and distribute stable releases of the model codebase and resulting database. Archiving, documenting and openly distributing ARTIS is a critical component in contributing to the larger open-science and reproducible science community. ARTIS uses KNB as an access point for anyone to download the ARTIS model codebase and clean inputs and output ARTIS database
KNB is guided by FAIR (findable, accessible, interoperable, resuble) principles of data sharing and preservation and issues unique DOIs (digital object identifier) to each data package and every version of the package for long term access, transparency, and informative citations.
KNB is a member of DataONE (Data Observation Network for Earth); a network of data repositories. KNB uses EML (Ecological Metadata Language) to document objects within a data packages which can be created via the website GUI (graphical user interface) or through a series of R packages rdataone and arcticdatautils.
Note: a user will need an ORCiD to log into KNB.
For additional resources on how to submit data to KNB see the links below from NCEAS (National Center for Ecological Analysis and Synthesis) who develops and maintains DataONE and KNB.
This repo contains scripts to semi-automatically create EML formatted metadata to accompany the data submission to the KNB data repository.
FIXIT: Add more instructions about running these scripts and what the specific files do.
\
- Remove /
.gitignore.metadata/attributers_*.txtand.metdata/catvars_*.txtfiles.- Can easily be recreated in the
run_EMLassemblyline_for_metadata-files.R
- Can easily be recreated in the
- Keep
.metadata/abstract.md,.metadata/additional_info.md,.metadata/intellectual_rights.txt,.metadata/keywords.txt,.metadata/methods.md, and.metadata/personnel.txt.- Contain values that should be carried over to next release version.
EMLassemblylinedoes not work for parquet files. Added workaround inrun_EMLassemblyline_for_metadata-files.R(EAL) to convert a sample of ARTIS KNB files into .csv to run through theEALworkflow to generate first valid iteration of the EML doc. Probably need some "post-processing" EML.xml edits to correct things like EML<physical>elements.
- Missing values
""forcategoricalclass data attributes don't seem to validate when callingEMLassemblyline::make_eml(). I retained the definitions inartis_data_dictionary_attributes_catvars.txtbut will assignNAas the missing value personnel.txtrolecolumn does not take EML specific values for theResearchProjectType/personnel/role. It seems likeEMLassemblyline::make_eml()translates the valid values outlined in EAL DIY instructions and translates them to valid EMLResearchProjectType/personnel/rolevalues.
<physical>reads from.csvfiles in./metadata-files/ data_objects/- Might need work around to accurately represent.parquetfiles