Skip to content

madhurlak0810/SEC-edgar

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

38 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Financial Data Analytics Platform

Overview

The Financial Data Analytics Platform leverages data science and machine learning to analyze sectoral financial trends using data from the SEC EDGAR database. This project demonstrates how to transform raw financial data into actionable insights for stakeholders such as investors, policymakers, and business leaders.

Key Features

  • Advanced Data Processing: Automates data curation and storage using Python and MongoDB.
  • Interactive Dashboards: Visualizes insights with Tableau, highlighting trends, profitability, and risks.
  • Machine Learning Models: Implements predictive analysis to forecast market trends and evaluate company performance.
  • Summarization with LLMs: Uses OpenAI GPT models for summarizing complex 10-K filings.

Objectives

  1. Analyze financial data trends across sectors (e.g., Technology, Finance, Healthcare).
  2. Identify correlations between profitability, risk, and research investments.
  3. Present insights through interactive visualizations and dashboards.

Data Source

  • SEC EDGAR Database: Financial filings of publicly traded U.S. companies.

Tools and Technologies

  • Programming Languages: Python
  • Database: MongoDB
  • Visualization: Tableau
  • Libraries: pandas, BeautifulSoup, LangChain
  • APIs: SEC EDGAR API
  • Machine Learning: Random Forest, Statistical Testing

How to Run the Project

  1. Clone the repository:

    git clone https://github.com/madhurlak0810/SEC-edgar.git
  2. Install dependencies:

    pip install -r requirements.txt
  3. Setup MongoDB and OpenAI credentials

  4. Go through qualitative analysis and Project checkpoint:

  5. Visualize the insights with Tableau or view pre-generated dashboards using the tbwx file and excel.

Contributions

  • Madhur Lakshmanan: Data curation, preprocessing, and MongoDB integration and OpenAI integration and Github Pages implementation.
  • Inesh Tandon: Machine learning model design and performance validation.
  • Rohan Jain: Visualization and result analysis.
  • Balamurugan: Visualization and documentation.
  • Abhyansh Anand: Report creation and sectoral analysis.

GitHub Repository: SEC-edgar

Contact

For questions or suggestions, please contact Madhur Lakshmanan.


About

The Financial Data Analytics Platform leverages data science and machine learning to analyze sectoral financial trends using data from the SEC EDGAR database. This project demonstrates how to transform raw financial data into actionable insights for stakeholders such as investors, policymakers, and business leaders.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors