Skip to content

Shiva-Prasad-Naroju/News-Article-Classification-Using-NLP

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

4 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

πŸ“° News Topic Classifier & Text Analyzer


This project is a powerful News Topic Classifier that automatically categorizes news articles into one of four topics: 🌎 World , πŸ… Sports , πŸ’Ό Business and πŸ”¬ Science/Technology

Built using Natural Language Processing (NLP) techniques and a Logistic Regression classifier (achieving 91% accuracy), the project features a modern, interactive web app built and deployed with Streamlit Community Cloud.

✨ Features:

πŸ€– Automatic Topic Classification using Logistic Regression and TF-IDF

πŸ’» Interactive Web App: User-friendly, visually appealing, and easy to use

πŸ“Š Model Confidence Visualization: See prediction probabilities for each topic

πŸ“ˆ Text Analytics: Get word count, unique words, top keywords, and more

🧹 Preprocessing Transparency: View how your text is cleaned before prediction

πŸ“ Sample Inputs: Try out the app with real-world news examples

🌐 Live Demo:

Link : https://new-article-classification-using-nlp-nrsrc3pxheagvv2vwsxqvd.streamlit.app/

πŸ–₯️ Try the App on Streamlit Cloud!

πŸ—οΈ Project Structure:

β”œβ”€β”€ app.py # Streamlit web app

β”œβ”€β”€ log_reg_model.pkl # Trained Logistic Regression model

β”œβ”€β”€ tfidf_vectorizer.pkl # TF-IDF vectorizer

β”œβ”€β”€ requirements.txt # Project dependencies

β”œβ”€β”€ README.md # This file

└── News_Classification.ipynb # Jupyter notebook

πŸ“Š Model Performance:

Classifier: Logistic Regression

Feature Extraction: TF-IDF

Test Accuracy: 91%

Classes: World, Sports, Business, Science/Technology

🏁 Conclusion:

πŸ€– Built a robust Logistic Regression classifier with 91% accuracy for news topic classification.

πŸ’» Developed an interactive Streamlit web app for easy, real-time predictions.

πŸ“Š Included model confidence visualization and text analytics for deeper insights.

πŸš€ Deployed the app on Streamlit Community Cloud for public access and demonstration.

About

πŸš€ Built a News Topic Classifier using NLP techniques with Logistic Regression and TF-IDF πŸ’»βœ¨ Developed and deployed an interactive, user-friendly Streamlit web app for real-time πŸ“° news classification and πŸ“Š text analytics

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors