https://github.com/sayande01/fake_news_detection_logisticregression

This project detects fake news using Logistic Regression with NLP techniques, including NLTK stopword removal, Porter Stemmer for text normalization, and TF-IDF vectorization for feature extraction. It achieves high accuracy and precision, offering a reliable solution to combat misinformation.
https://github.com/sayande01/fake_news_detection_logisticregression

logistic-regression nltk porter-stemmer stopwords tf-idf-vectorizer

Last synced: about 2 months ago
JSON representation

Host: GitHub
URL: https://github.com/sayande01/fake_news_detection_logisticregression
Owner: sayande01
Created: 2024-11-13T15:43:43.000Z (7 months ago)
Default Branch: main
Last Pushed: 2024-11-15T12:55:55.000Z (7 months ago)
Last Synced: 2025-02-13T02:38:38.244Z (4 months ago)
Topics: logistic-regression, nltk, porter-stemmer, stopwords, tf-idf-vectorizer
Language: Jupyter Notebook
Homepage:
Size: 33.2 KB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

### **Title**
**"Fake News Detection: A Logistic Regression Approach with NLP Techniques"**

---

### **Description**
This project focuses on detecting fake news using a Logistic Regression model coupled with powerful NLP techniques. The system processes textual data with NLTK's stopword removal, Porter Stemmer for text normalization, and TF-IDF vectorization for feature extraction. Achieving high accuracy and precision, this solution provides a reliable approach to combating misinformation.

Key highlights include:
- **Text Preprocessing**: Includes stopword removal, stemming, and normalization using NLTK's Porter Stemmer.
- **Feature Extraction**: Utilizes **TF-IDF Vectorizer** to convert text into numerical features for model input.
- **Classification Model**: Logistic Regression model ensures efficient and interpretable predictions.
- **Performance Metrics**: Demonstrates high accuracy, recall, and F1-Score, ensuring robust fake news detection.

This system is a comprehensive, data-driven solution designed to help mitigate the spread of fake news effectively.

---

### **Objective**
1. **Develop an NLP-based pipeline** for preprocessing news articles, including:
- Stopword removal using NLTK.
- Text normalization with Porter Stemmer.
- Feature extraction via TF-IDF Vectorizer.
2. **Build and train a Logistic Regression model** to classify news as real or fake.
3. **Achieve high classification performance**, with metrics such as:
- **Accuracy**: > 90%
- **Precision** and **Recall**: > 85%
4. **Provide a scalable and interpretable model** for real-world fake news detection tasks.

---

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/sayande01/fake_news_detection_logisticregression

Awesome Lists containing this project

README