14 Mar 16:45

tradertanmay

dfe180c

v0.2.0: The Polishing Release Latest

Latest

TanML v0.2.0: The Polishing Release

This release focuses on industrial-grade refinements, documentation clarity, and professional presentation.

✨ Key Improvements

Branding Consistency: Corrected capitalization for Scikit-learn, Statsmodels, and YData Profiling across all files.
Refined Terminology: Updated workflow steps to Data Preprocessing and Feature Power Ranking for better UI/Docs alignment.
Audit Ready: Removed dashes from audit ready and clarified report generation descriptions.
Header Revamp: Tightened README header spacing and added a new OS Compatibility Badge.
Scientific Reproducibility: Added scripts/generate_demo_data.py with fixed seeds to ensure consistent validation results.

🛠️ Internal Cleanup

Removed tanml_runs clutter and redundant scripts from the main branch.
Updated all GitHub Workflows to use Trusted Publishing and fixed dynamic version detection for documentation.
Cleared legacy Node.js deprecation warnings in CI/CD pipelines.

Assets 2

13 Jan 14:47

tradertanmay

v0.1.10

69e3f30

v0.1.10

Release 0.1.10: UCI support, Python 3.13 fixes, enhanced duplicate/ou…

Assets 2

23 Dec 16:22

tradertanmay

v0.1.8

333b853

v0.1.8 - Official Launch 🚀

Official Release of TanML v0.1.8

🌟 Key Changes

Consolidated Output: All runs now saved to tanml_runs/.
UI Refactoring: Improved modularity and navigation.
Documentation: Updated README.md and pyproject.toml.
Clean History: Removed large legacy files.

Enjoy!

What's Changed

Clean main by @tradertanmay in #5

Full Changelog: v0.1.7...v0.1.8

Contributors

tradertanmay

Assets 2

11 Oct 00:57

tradertanmay

v0.1.7

a5d052a

TanML v0.1.7 Beta

TanML: Automated Model Validation Toolkit for Tabular Machine Learning

TanML validates tabular ML models with a zero-config Streamlit UI and exports an audit-ready, editable Word report (.docx). It covers data quality, correlation/VIF, performance, explainability (SHAP), and robustness/stress tests—built for regulated settings (MRM, credit risk, insurance, etc.).

Status: Beta (0.x)
License: MIT
Python: 3.8–3.12
OS: Linux / macOS / Windows (incl. WSL)

Why TanML?
Install
Quick Start (UI)
What TanML Checks
Optional CLI Flags
Templates
Troubleshooting
Data Privacy
Contributing
License & Citation

Why TanML?

Zero-config UI: launch Streamlit, upload data, click Run—no YAML needed.
Audit-ready outputs: tables/plots + a polished DOCX your stakeholders can edit.
Regulatory alignment: supports common Model Risk Management themes (e.g., SR 11-7 style).
Works with your stack: scikit-learn, XGBoost/LightGBM/CatBoost, etc.

Install

pip install tanml

Quick Start (UI)

tanml ui

Opens at http://127.0.0.1:8501
Upload limit ~1 GB (preconfigured)
Telemetry disabled by default

In the app

Load data — upload a cleaned CSV/XLSX/Parquet (optional: raw or separate Train/Test).
Select target & features — target auto-suggested; features default to all non-target columns.
Pick a model — choose library/algorithm (scikit-learn, XGBoost, LightGBM, CatBoost) and tweak params.
Run validation — click ▶️ Refit & validate.
Export — click ⬇️ Download report to get a DOCX (auto-selects classification/regression template).

Outputs

Report: ./.ui_runs/<session>/tanml_report_*.docx
Artifacts (CSV/PNGs): ./.ui_runs/<session>/artifacts/*

What TanML Checks

Raw Data (optional): rows/cols, missingness, duplicates, constant columns
Data Quality & EDA: summaries, distributions
Correlation & Multicollinearity: heatmap, top-pairs CSV, VIF table
Performance
- Classification: AUC, PR-AUC, KS, decile lift, confusion
- Regression: R², MAE, MSE/RMSE, error stats
Explainability: SHAP (auto explainer; configurable background size)
Robustness/Stress Tests: feature perturbations → delta-metrics
Model Metadata: model class, hyperparameters, features, training info

Optional CLI Flags

Most users just run tanml ui. These help on teams/servers:

# Share on LAN
tanml ui --public

# Different port
tanml ui --port 9000

# Headless (server/CI; no auto-open browser)
tanml ui --headless

# Larger limit (e.g., 2 GB)
tanml ui --max-mb 2048

Env var equivalents (Linux/macOS bash):

TANML_SERVER_ADDRESS=0.0.0.0 TANML_PORT=9000 TANML_MAX_MB=2048 tanml ui

Windows PowerShell:

$env:TANML_SERVER_ADDRESS="0.0.0.0"; $env:TANML_PORT="9000"; $env:TANML_MAX_MB="2048"; tanml ui

Defaults: address 127.0.0.1, port 8501, limit 1024 MB, telemetry OFF.

Templates

TanML ships DOCX templates (packaged in wheel & sdist):

tanml/report/templates/report_template_cls.docx
tanml/report/templates/report_template_reg.docx

Data Privacy

TanML runs locally; no data is sent to external services.
Telemetry is disabled by default (and can be forced off via --no-telemetry).
UI artifacts and reports are written under ./.ui_runs/<session>/ in your working directory.

Troubleshooting

Page didn’t open? Visit http://127.0.0.1:8501 or run tanml ui --port 9000.
Large CSVs are slow/heavy? Prefer Parquet; CSV → DataFrame can use several GB RAM.
Artifacts missing? Check ./.ui_runs/<session>/artifacts/.
Corporate networks: use tanml ui --public to share on LAN.

Contributing

We welcome issues and PRs!

Create a virtual environment and install dev extras:
- python -m venv .venv && source .venv/bin/activate (or \.venv\Scripts\activate on Windows)
- pip install -e .[dev]
Format/lint: black . && isort .
Run tests: pytest

Before opening a PR, please describe the change and include a brief test or reproduction steps where applicable.

License & Citation

License: MIT. See LICENSE.
SPDX-License-Identifier: MIT

How to cite

If TanML helps your work or publications, please cite:

Sah, T., & Sah, D. (2025). TanML: Automated Model Validation Toolkit for Tabular Machine Learning [Software]. Available at https://github.com/tdlabs-ai/tanml

Or in BibTeX (version-agnostic):

@misc{tanml,
  author = {Sah, Tanmay and Sah, Dolly},
  title  = {TanML: Automated Model Validation Toolkit for Tabular Machine Learning},
  year   = {2025},
  note   = {Software; MIT License},
  url    = {https://github.com/tdlabs-ai/tanml}
}

A machine-readable citation file (CITATION.cff) is included for citation tools and GitHub’s “Cite this repository” button.

Assets 2

Releases: tdlabs-ai/tanml

v0.2.0: The Polishing Release

TanML v0.2.0: The Polishing Release

✨ Key Improvements

🛠️ Internal Cleanup

Uh oh!

v0.1.10

Uh oh!

v0.1.8 - Official Launch 🚀

🌟 Key Changes

What's Changed

Contributors

Uh oh!

TanML v0.1.7 Beta

TanML: Automated Model Validation Toolkit for Tabular Machine Learning

Table of Contents

Why TanML?

Install

Quick Start (UI)

In the app

What TanML Checks

Optional CLI Flags

Templates

Data Privacy

Troubleshooting

Contributing

License & Citation

How to cite

Uh oh!