This repository contains scripts (typically in the Python programming language) that are used to extract text from document originating from various sources.
This United States Environmental Protection Agency (EPA) GitHub project code is provided on an "as is" basis and the user assumes responsibility for its use. With respect to documents available from this repository, neither the United States Government nor any of their employees, makes any warranty, express or implied, including the warranties of merchantability and fitness for a particular purpose, or assumes any legal liability or responsibility for the accuracy, completeness, or usefulness of any information, apparatus, product, or process disclosed, or represents that its use would not infringe privately owned rights.
Reference herein to any specific commercial products, process, or service by trade name, trademark, manufacturer, or otherwise, does not necessarily constitute or imply its endorsement, recommendation, or favoring by the United States Government. The views and opinions of the developers of the site expressed herein do not necessarily state or reflect those of the United States Government, and shall not be used for advertising or product endorsement purposes.
Efforts related post-extraction curation, quality assurance processes, and data obtained from code in this repository are detailed in Handa et al. Scientific Data. The database resulting from these scripts is the Chemicals and Products Database (CPDat) and may be accessed via Figshare (https://doi.org/10.23645/epacomptox.5352997.v5) or the Chemical Exposure (ChemExpo) Knowledgebase (https://comptox.epa.gov/chemexpo) The files included herein do not represent and should not be construed to represent any Agency determination or policy.
The work reported here was funded by the United States Environmental Protection Agency, in part under Contract CIO-SP3, HHSN316201200013W to General Dynamics Information Technology, Inc and contract 68HERH20D0003 to Oak Ridge Associated Universities.