This project performs Natural Language Processing (NLP) on Amazon Alexa reviews to classify sentiments using Decision Tree and Random Forest models. It includes data preprocessing, visualization, and model training with Flask integration for deployment.
βοΈ Sentiment analysis of Amazon Alexa reviews
βοΈ Data visualization using Matplotlib & Seaborn
βοΈ Word cloud representation of positive & negative words
βοΈ Preprocessing with stop words removal & stemming
βοΈ Model training using Decision Tree & Random Forest
βοΈ Hyperparameter tuning with GridSearchCV
βοΈ Flask-based web interface for predictions
The dataset contains Amazon Alexa reviews with:
- verified_reviews (Text of the review)
- rating (Star rating of the review)
- feedback (1 for positive, 0 for negative)
- variation (Alexa device variant)
- Distribution of ratings & feedback
- Variation-wise rating analysis
- Word clouds for positive & negative words
- Python
- Flask
- NLTK (Text preprocessing)
- Scikit-learn (Machine Learning)
- Matplotlib & Seaborn (Data Visualization)
- Pandas & NumPy (Data Manipulation)
- Pickle (Model saving)
- Preprocessing: Tokenization, Stopword removal, Stemming
- Feature Extraction: CountVectorizer (Bag of Words)
- Training:
- Decision Tree Classifier
- Random Forest Classifier
- Hyperparameter Tuning: GridSearchCV