This project is a powerful News Topic Classifier that automatically categorizes news articles into one of four topics: π World , π Sports , πΌ Business and π¬ Science/Technology
Built using Natural Language Processing (NLP) techniques and a Logistic Regression classifier (achieving 91% accuracy), the project features a modern, interactive web app built and deployed with Streamlit Community Cloud.
π€ Automatic Topic Classification using Logistic Regression and TF-IDF
π» Interactive Web App: User-friendly, visually appealing, and easy to use
π Model Confidence Visualization: See prediction probabilities for each topic
π Text Analytics: Get word count, unique words, top keywords, and more
π§Ή Preprocessing Transparency: View how your text is cleaned before prediction
π Sample Inputs: Try out the app with real-world news examples
Link : https://new-article-classification-using-nlp-nrsrc3pxheagvv2vwsxqvd.streamlit.app/
π₯οΈ Try the App on Streamlit Cloud!
βββ app.py # Streamlit web app
βββ log_reg_model.pkl # Trained Logistic Regression model
βββ tfidf_vectorizer.pkl # TF-IDF vectorizer
βββ requirements.txt # Project dependencies
βββ README.md # This file
βββ News_Classification.ipynb # Jupyter notebook
Classifier: Logistic Regression
Feature Extraction: TF-IDF
Test Accuracy: 91%
Classes: World, Sports, Business, Science/Technology
π€ Built a robust Logistic Regression classifier with 91% accuracy for news topic classification.
π» Developed an interactive Streamlit web app for easy, real-time predictions.
π Included model confidence visualization and text analytics for deeper insights.
π Deployed the app on Streamlit Community Cloud for public access and demonstration.