AI-powered e-commerce customer churn prediction web app built with Python, Random Forest and Streamlit
This project predicts whether a customer will churn or not using machine learning. It analyzes customer behavior like app usage, order patterns, and engagement to identify high-risk customers.
Customer churn is a major issue for ecommerce companies. This project helps identify customers who are likely to leave so that businesses can take preventive actions.
- Ecommerce customer dataset (https://www.kaggle.com/datasets/ankitverma2010/ecommerce-customer-churn-analysis-and-prediction)
- Sheet used: E Comm
- Features include:
- Tenure
- Cashback Amount
- Warehouse distance
- Order behavior
- Preferred login device
- Payment mode
- Python
- Pandas and NumPy
- Matplotlib and Seaborn
- Scikit-learn
- Data Loading
- Data Cleaning (handling missing values using median)
- Encoding categorical variables
- Exploratory Data Analysis
- Model Building using Random Forest
- Model Evaluation
- Feature Importance Analysis
- Random Forest Classifier
- Model Accuracy: 98%+
- Confusion Matrix used for performance evaluation
- Classification Report generated
- Customer tenure and engagement significantly impact churn
- Purchase behavior patterns help identify high-risk customers
- Feature importance highlights key drivers of churn
- Built a function to take customer inputs and predict churn
- Helps classify customers into risk categories
- Model saved as churn_model.pkl using pickle
- Can be reused without retraining
- Upload the dataset (Excel file)
- Run the notebook or Python file
- Use prediction function to test new data
ANIKA ATTRI



