A comprehensive exploratory data analysis (EDA) of 1,000 supermarket transactions, uncovering customer behavior, payment trends, revenue drivers, and branch-level performance.
The dataset includes 1,000 transaction entries with the following attributes:
- Invoice ID
- Branch (A, B, C)
- City (Yangon, Mandalay, Naypyitaw)
- Customer Type (Member / Normal)
- Gender
- Product Line
- Unit Price
- Quantity
- Tax (5%)
- Total
- Date & Time
- Payment Method (Cash, Ewallet, Credit Card)
- COGS
- Gross Income
- Rating
- Loaded CSV using pandas
- Verified shape: 1000 × 17
- Confirmed no missing values
- Converted Date column to datetime
- Produced descriptive statistics using
.describe()
Countplot comparing Male vs Female customers.
Histogram showing rating distribution.
Transaction counts for branches A, B, and C.
- Ewallet: 345
- Cash: 344
- Credit Card: 311
Includes a stacked bar chart by branch.
Boxplot comparing customer satisfaction across branches.
- Gross income by branch
- Gross income by gender
- Daily gross income trend
- Branch C showed the highest total gross income
- Customers are evenly split: 501 Female, 499 Male
- Branch C generated the highest gross income
- Ewallet was slightly more preferred than Cash and Credit Card
- Ratings were consistently positive across branches
- Fashion Accessories, Electronics, and Health & Beauty were top revenue-generating product lines
- Female customers contributed slightly higher average gross income
- Python
- Pandas
- NumPy
- Matplotlib
- Seaborn
- Jupyter Notebook
bash
git clone https://github.com/yourusername/SuperMarketAnalysis.git
bash Copy code pip install pandas numpy matplotlib seaborn
bash Copy code jupyter notebook SuperMarket.ipynb
Add predictive modeling (sales forecasting)
Customer segmentation via clustering
Build a dashboard using PowerBI, Streamlit, or Plotly
Product-line based revenue forecasting
Pull requests are welcome. Feel free to submit improvements, visualizations, or new analyses.
If you found this project useful, consider giving the repository a star.