An open API service indexing awesome lists of open source software.

https://github.com/kingabzpro/malawi-news-classification

Using text classifier to predict various categories in Malawi News articles using SMOTE and SGDClassifier.
https://github.com/kingabzpro/malawi-news-classification

africa multiclass-classification nlp-machine-learning oversampling

Last synced: about 2 months ago
JSON representation

Using text classifier to predict various categories in Malawi News articles using SMOTE and SGDClassifier.

Awesome Lists containing this project

README

        

# Malawi-News-Classification
Using text classifier to predict various categories in Malawi News articles using SMOTE and SGDClassifier.

[![View in Deepnote](https://deepnote.com/static/buttons/view-in-deepnote.svg)](https://deepnote.com/viewer/github/kingabzpro/Malawi-News-Classification/blob/main/malawi-news-classification.ipynb)

![cover](Malawi-News-Class.jpg)

The project code is simple and effective on competitive grounds. I have experimented with Vectorizer, Porter stemmer for test preprocessing. I have also used multiple methods to clean my text to improved overall model performance. In the end, I have used SKlearn Stochastic Gradient Decent (SGD) classifier for predicting News categories. I have also experimented with various neural networks and gradient boosting models, but they all failed as simple logistics regression with minimum hyperparameter tunning works quite well on this data.

> To understand the code read my article on [Medium](https://pub.towardsai.net/malawi-news-classification-an-nlp-project-adfa867abfd9)