A curated list of open-source tools, papers, datasets, and frameworks for building AI/LLM applications in regulated industries.
Building AI that actually works when compliance matters.
- RAG for Regulatory Documents
- Compliance & Risk
- Financial Services
- Healthcare
- Evaluation & Testing
- Papers
- RegAI - RAG pipeline for SEBI/RBI circulars with regulatory-aware chunking
- sec-insights - LlamaIndex-powered RAG over SEC filings
- chatlaw - LLM for Chinese legal domain with retrieval augmentation
- lexpredict-lexnlp - NLP library for legal and regulatory text
- guardrails - Input/output guards for LLM applications
- nemo-guardrails - Programmable guardrails for LLM conversational systems
- rebuff - Prompt injection detection framework
- finrobot - Open-source AI agent for financial analysis
- fingpt - Open-source financial LLMs
- BloombergGPT - 50B parameter LLM trained on financial data (paper)
- MedPaLM - Google's medical domain LLM (paper)
- open-biomedical - Open-source biomedical AI toolkit
- clinical-bert - BERT models pre-trained on clinical notes
- deepeval - LLM evaluation framework with regulatory-relevant metrics
- ragas - RAG evaluation framework
- promptfoo - LLM output testing and evaluation
- Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks - The original RAG paper
- Regulatory Compliance through Doc-RAG - RAG for regulatory compliance
PRs welcome. If you're building AI for regulated industries, open an issue or submit a link.
CC0 1.0 Universal