An open API service indexing awesome lists of open source software.

https://github.com/sayande01/fake_news_detection_logisticregression

This project detects fake news using Logistic Regression with NLP techniques, including NLTK stopword removal, Porter Stemmer for text normalization, and TF-IDF vectorization for feature extraction. It achieves high accuracy and precision, offering a reliable solution to combat misinformation.
https://github.com/sayande01/fake_news_detection_logisticregression

logistic-regression nltk porter-stemmer stopwords tf-idf-vectorizer

Last synced: about 2 months ago
JSON representation

This project detects fake news using Logistic Regression with NLP techniques, including NLTK stopword removal, Porter Stemmer for text normalization, and TF-IDF vectorization for feature extraction. It achieves high accuracy and precision, offering a reliable solution to combat misinformation.

Awesome Lists containing this project

README

        

### **Title**
**"Fake News Detection: A Logistic Regression Approach with NLP Techniques"**

---

### **Description**
This project focuses on detecting fake news using a Logistic Regression model coupled with powerful NLP techniques. The system processes textual data with NLTK's stopword removal, Porter Stemmer for text normalization, and TF-IDF vectorization for feature extraction. Achieving high accuracy and precision, this solution provides a reliable approach to combating misinformation.

Key highlights include:
- **Text Preprocessing**: Includes stopword removal, stemming, and normalization using NLTK's Porter Stemmer.
- **Feature Extraction**: Utilizes **TF-IDF Vectorizer** to convert text into numerical features for model input.
- **Classification Model**: Logistic Regression model ensures efficient and interpretable predictions.
- **Performance Metrics**: Demonstrates high accuracy, recall, and F1-Score, ensuring robust fake news detection.

This system is a comprehensive, data-driven solution designed to help mitigate the spread of fake news effectively.

---

### **Objective**
1. **Develop an NLP-based pipeline** for preprocessing news articles, including:
- Stopword removal using NLTK.
- Text normalization with Porter Stemmer.
- Feature extraction via TF-IDF Vectorizer.
2. **Build and train a Logistic Regression model** to classify news as real or fake.
3. **Achieve high classification performance**, with metrics such as:
- **Accuracy**: > 90%
- **Precision** and **Recall**: > 85%
4. **Provide a scalable and interpretable model** for real-world fake news detection tasks.

---