A tool for extracting and analyzing clinical trial data using Large Language Models.
This project extracts structured information from clinical trials using LLMs, focusing on key parameters and outcomes. The pipeline includes data acquisition, filtering, LLM-based information extraction, and statistical analysis.
git clone https://github.com/yourusername/ClinicalTrialLLM-Extractor.git
cd ClinicalTrialLLM-Extractor
pip install -r requirements.txtdata/- Raw and processed datasetsraw/- Original clinical trial datafiltered/- Filtered datasetsprocessed/- LLM-processed data
notebooks/- Jupyter notebooks for pipeline stagesfigures/- Generated visualizationsresults/- Final analysis outputs
- Data acquisition:
jupyter notebook notebooks/01_dataset_pull.ipynb - Data filtering:
jupyter notebook notebooks/02_simple_filtering.ipynb - LLM processing:
jupyter notebook notebooks/03_LLM_processor.ipynb - Analysis:
jupyter notebook notebooks/paper_stats.ipynb
This project is licensed under the MIT License - see the LICENSE file for details.