nse-xbrl-parser

An open-source Python library to robustly parse National Stock Exchange (NSE) XBRL corporate announcements and automatically convert them into clean, human-readable JSON facts.

nse-xbrl-parser elegantly solves the infamous "Missing Schema" / "Missing XSD" problem. NSE XBRL filings often reference ancient taxonomy templates that have been wiped from their servers, or they import core schemas that are inexplicably omitted from the category-specific ZIP downloads.

This library acts as a 100% offline parsing engine. It resolves all these references against an internal, hierarchical taxonomies archive. By temporarily locating the XML filing alongside the schema, it allows native relative resolution of all sub-dependencies, ensuring your production Docker containers remain completely clean and data is resolved with zero internet access.

🚀 Features

Offline Resolution: Comprehensive, hierarchical NSE taxonomies bundled. No internet required during the Arelle schema validation phase.
Robust Resolution Engine: Automatically handles complex import paths (e.g., ../core/types.xsd) by enforcing local filesystem resolution.
Read-Only Safe: Performs temporary operations in a sandbox; perfectly compliant with locked-down Python package installations.
Human-Readable Labels: Preferentially maps obtuse XBRL QNames (e.g. ns:Management) back directly into their English equivalents (e.g. "Change in Management").
Agent/LLM Optimized: Strictly typed, fully documented, and absolutely silent (suppresses noisy stdout warnings emitted by processing engines). Output JSON is perfect for fundamental analysis bots.
Smart Versioning CI/CD pipleine to constantly check for new taxonomies and add the new versions only if the actual schemas have been updated for any family/if new families are added

🛠️ Usage

Install using standard Python package managers:

uv add git+https://github.com/eggmasonvalue/nse-xbrl-parser.git

And parse a downloaded *.xml filing in Python:

from nse_xbrl_parser import parse_xbrl_file
from pathlib import Path

# Provide the absolute path to your downloaded instance
filing_path = Path("SWIGGY_announcement_1.xml")

# The parser automatically detects the required schemas, performs validation,
# pulls the labels, and emits a clean JSON Dictionary
facts = parse_xbrl_file(filing_path)

print(facts)
# Example Out:
# {
#   "Reason for change": "Resignation",
#   "Date of appointment/cessation": "2024-05-15",
#   "Brief profile": "Jane Doe was leading..."
# }

🏗️ Architecture

Refer to the .context/ living documentation directory for architectural specifics and design patterns enforcing this modular approach.

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
.context		.context
.github/workflows		.github/workflows
scripts		scripts
src/nse_xbrl_parser		src/nse_xbrl_parser
tests		tests
.gitignore		.gitignore
.python-version		.python-version
AGENTS.md		AGENTS.md
README.md		README.md
main.py		main.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

nse-xbrl-parser

🚀 Features

🛠️ Usage

🏗️ Architecture

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

nse-xbrl-parser

🚀 Features

🛠️ Usage

🏗️ Architecture

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages