Exploring Sales Analytics with SQL and Python

Project Overview

This project demonstrates how to integrate Python and SQL for creating, storing, and analyzing sales data. It involves generating synthetic data, performing SQL queries for actionable insights, and conducting Exploratory Data Analysis (EDA) with Python to reveal patterns and trends through visualizations.

Objectives

Step 1: Dataset Generation

Task 1: Dataset Creation Generate a dataset with the following attributes:
- Customer ID: Unique IDs from 1001 to 1200.
- Customer Name: Random names generated using Faker.
- Product ID: IDs from 1 to 20.
- Purchase Date: Random dates from the last year.
- Quantity: Random values between 1 and 10.
- Price per Unit: Prices ranging from 10.00 to 1000.00.
- Region: Randomly assigned as "North," "South," "East," or "West."
Task 2: Insert Data into SQL
- Define an SQL table schema matching the dataset attributes.
- Populate the SQL database using Python.

Step 2: SQL Queries

Perform the following queries:

Total Sales by Region: Calculate quantity * price_per_unit per region.
Top Products: Retrieve the top 5 products by total sales.
Monthly Sales: Calculate monthly total sales.
Customer Analysis: Find the total amount spent by each customer.
Regional Product Sales: Show product-wise sales for each region.

Step 3: Exploratory Data Analysis (EDA)

Retrieve Data to Python:
- Use libraries like pandas and sqlite3/mysql-connector to load data into a DataFrame.
EDA Tasks:
- Summary Statistics: Compute mean, median, max, min, and standard deviation for quantity and price_per_unit.
- Sales by Region: Summarize total sales for each region.
- Top Customers: Identify the 5 highest-spending customers.
Visualizations:
- Bar Plot: Total sales per region.
- Pie Chart: Proportions of sales by product.
- Line Plot: Monthly sales trends.
- Scatter Plot: Relationship between quantity and price per unit.

Technologies and Libraries

Python: For data generation, analysis, and visualization.
- pandas for data manipulation.
- numpy for numerical operations.
- Faker for synthetic data generation.
- matplotlib and seaborn for plotting.
SQL: SQLite/MySQL for data storage and querying.
- SQLAlchemy for database interaction.

Setup Instructions

Install necessary Python libraries:

pip install pandas numpy faker sqlalchemy pymysql matplotlib seaborn

Run the dataset generation script to populate the SQL database.
Execute the SQL script for queries.
Load the data back into Python for analysis and visualization.

Sample Visualizations

Bar Plot: Total Sales by Region

Line Plot: Monthly Sales Trends

Pie Chart: Sales Proportion by Product

Scatter Plot: Quantity vs. Price Relationship

Insights

Regional Trends: Certain regions dominate sales, providing guidance for resource allocation.
Top Products: High-demand products drive revenue; focus marketing on these items.
Seasonality: Monthly trends highlight peak sales periods.
Pricing Patterns: Analyzing quantity vs. price helps understand customer purchasing behavior.

Deliverables

Python scripts for dataset generation and SQL integration.
SQL script for queries.
Jupyter notebook or Python script for EDA and visualizations.
A detailed README for project setup and usage instructions.

Author

[Manas Jadhav]
For any questions or feedback, feel free to reach out!

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
Cust_product.csv		Cust_product.csv
EDA.ipynb		EDA.ipynb
README.md		README.md
SQL queries.sql		SQL queries.sql
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Exploring Sales Analytics with SQL and Python

Project Overview

Objectives

Step 1: Dataset Generation

Step 2: SQL Queries

Step 3: Exploratory Data Analysis (EDA)

Technologies and Libraries

Setup Instructions

Sample Visualizations

Bar Plot: Total Sales by Region

Line Plot: Monthly Sales Trends

Pie Chart: Sales Proportion by Product

Scatter Plot: Quantity vs. Price Relationship

Insights

Deliverables

Author

About

Uh oh!

Releases

Packages

Languages

manasjadhav0086/Python-SQL-Integration-with-EDA

Folders and files

Latest commit

History

Repository files navigation

Exploring Sales Analytics with SQL and Python

Project Overview

Objectives

Step 1: Dataset Generation

Step 2: SQL Queries

Step 3: Exploratory Data Analysis (EDA)

Technologies and Libraries

Setup Instructions

Sample Visualizations

Bar Plot: Total Sales by Region

Line Plot: Monthly Sales Trends

Pie Chart: Sales Proportion by Product

Scatter Plot: Quantity vs. Price Relationship

Insights

Deliverables

Author

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages