fake-news-detection

Developed an end‑to‑end Fake‑News Detection pipeline that blends rigorous exploratory data analysis with production‑ready machine‑learning practices. I began by profiling class balance and text‑length distributions, using regular expressions, lemmatization, contraction expansion, and stop‑word removal to create a clean corpus. Word‑ and character‑level insights were extracted through unigram, bigram, and trigram frequency analysis, while VADER sentiment scores captured tonal cues often overlooked by traditional features.

For feature engineering, I combined sentiment signals with high‑dimensional TF‑IDF vectors, then trained and tuned both Logistic Regression and SVM classifiers under a stratified five‑fold cross‑validation scheme. The model achieved over 92% F1-score with balanced generalization across classes, and the confusion matrix was instrumental in verifying that the classifier was not overfitting to the majority class or misclassifying borderline examples. Interpretability and bias detection were further enhanced by examining top TF-IDF tokens and analyzing misclassified samples.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
Fake_News_Detection.ipynb		Fake_News_Detection.ipynb
README.md		README.md
news.csv		news.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

fake-news-detection

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

fake-news-detection

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages